Cross-posted from “Extension showing similar repositories on Github” by @Mubelotix@jlai.lu in !firefox_addons@lemmy.ml
I created an extension adding a “Similar repositories” section to the sidebar of any public GitHub repository with more than 150 stars. It allows you to discover related open-source projects without leaving the page.
It’s also open-source! https://github.com/Mubelotix/SimRepo
Why is this 110MB though?
Because it stores the ML model. Everything is local
Thx for your answer, makes sense! I was already wondering how the recommendations are generated. Maybe you can put this in the readme as well? I think that will make it more clear what a user can expect.
Great idea, thank you for your feedback! I generated embeddings for repositories using a SVC model trained on a dataset containing almost all stars of Github. Recommendations are produced by finding the nearest neighbors in the resulting vectorial space. I might be improving this even further by including forks and watcher data in the future, but it’s becoming expensive to train