GitHub Repository Finder
I wrote a console tool in Python for searching GitHub repositories based on semantic descriptions. The program solves a simple problem — the standard GitHub search only looks for exact words in the title, while this tool understands the meaning of the query.
The user enters a short description in 2–3 words, such as "data visualization" or "telegram bot," and the program finds the most suitable repositories by itself. Under the hood, TF-IDF and Cosine Similarity algorithms work — they convert text into numerical vectors and compare them with each other.
The project is written in pure Python without third-party libraries. The code is divided into modules with a separation of responsibilities: a separate file for algorithms, for working with the API, for error handling with logging. The GitHub token is optional — it works without it, but is faster with it.
The user enters a short description in 2–3 words, such as "data visualization" or "telegram bot," and the program finds the most suitable repositories by itself. Under the hood, TF-IDF and Cosine Similarity algorithms work — they convert text into numerical vectors and compare them with each other.
The project is written in pure Python without third-party libraries. The code is divided into modules with a separation of responsibilities: a separate file for algorithms, for working with the API, for error handling with logging. The GitHub token is optional — it works without it, but is faster with it.