The goal of the project was to create a data collection system and implement an AI module for moderating NSFW content.
As a result, a clean database in Excel format was formed, completely free of duplicates and irrelevant content.
Stack: Python, Selenium, PyTorch, Transformers, Pandas, ImageHash.