Smart PDF to Excel Parser & Web Automation Tool
Developed a comprehensive solution based on Python for automating routine tasks related to processing technical documentation and interacting with web resources. The program allows transforming hours of manual catalog checking into mere seconds of automated work.
Key functionality:
Intelligent parsing (PDF Extraction): Automatic scanning of complex PDF files, extracting articles (NSN) and corresponding product descriptions.
Smart-Excel generation: Creating structured tables with embedded hyperlinks for instant access to the request form.
Web Automation (Playwright): Full automation of interaction with the website — the script independently opens the browser and fills in the necessary forms, eliminating human factors and errors.
Graphical User Interface (GUI): Developed a user-friendly control window on Tkinter with a progress bar for monitoring processing status in real-time.
Technology stack:
Language: Python
Libraries: Playwright (browser automation), Pandas (data manipulation), pdfplumber (text parsing), Tkinter (interface).
Key functionality:
Intelligent parsing (PDF Extraction): Automatic scanning of complex PDF files, extracting articles (NSN) and corresponding product descriptions.
Smart-Excel generation: Creating structured tables with embedded hyperlinks for instant access to the request form.
Web Automation (Playwright): Full automation of interaction with the website — the script independently opens the browser and fills in the necessary forms, eliminating human factors and errors.
Graphical User Interface (GUI): Developed a user-friendly control window on Tkinter with a progress bar for monitoring processing status in real-time.
Technology stack:
Language: Python
Libraries: Playwright (browser automation), Pandas (data manipulation), pdfplumber (text parsing), Tkinter (interface).