Parsing the website conrad.pl into xml on a schedule
Regular parsing is needed with a schedule, translation is not required and formatting is not needed, another person will do that
It is necessary to parse the Name, photos, Price without discount and with discount, availability, product code, description, and characteristics
Each group needs to be parsed separately by link
the category is already selected and the necessary filters are chosen, it is necessary to parse all product cards from all pages
there will be about 12 such links as there are main groups, but a filter for subgroups is also needed since not all subgroups need to be parsed, only some, or it can be done without a filter by just specifying more links for only those subgroups that are needed

The server I use https://hyperhost.ua/ for this project is rented separately, your configuration is needed.
This is a small marketplace with over 1 million products. With subgroup filters, it will yield up to 200 thousand.
1. There are only 12 main categories, which means there will be 12 links for parsing.

Here is an example of the first group link
https://www.conrad.pl/pl/search.html?categoryId=t01&tfr_price=0.112~~~141218.7&tfo_flags=priceReducedProduct&tfo_availabilityColor=green
The link already contains the filters I need: Price, New products and promotions, Availability.
But we need to parse not all subgroups but only selected ones.
2. Only links to photos from the site are needed.
3. Description with HTML markup.
All the data that needs to be parsed.


4. It is preferable to parse once a day or at least every 2 days. It would probably be good to do it this way:
if the product was previously parsed, just update the price, discount, quantity.
If the product is new, then parse all the data.It only outputs 2000 products, while many subgroups have many more products, so we need to find a way to bypass these limitations or perhaps we need to delve deeper into the group at level 3 or 4 to see if you can bypass this.
THE SITE IS PROTECTED BY CloudFlare.
Сервер использую https://hyperhost.ua/ для этого проекта арендую отдельный, настройка ваша нужна.
Это небольшая торговая площадка там более 1 мил. товаров. С фильтрами подгруп будет выходить до 200 тыс
1. Категорий главных всего 12, значит ссылок для парсинга будет 12

Вот для примера первая группа ссылка
https://www.conrad.pl/pl/search.html?categoryId=t01&tfr_price=0.112~~~141218.7&tfo_flags=priceReducedProduct&tfo_availabilityColor=green
В ссылке уже указаны нужные мне фильтры Цена, Новые продукты и акции, Доступность
Но нужно парсить не все подруппы а только избранные
2. Фото нужно только ссылки на них с сайта
3. Описание с html разметкой
Все данные которые надо парсить


4. Парсить желательно 1 раз в день или хотя бы в 2 дня, тут наверное было бы хорошо сделать так
если товар ранее был спарсен ранее то просто обновить цену, скидку, количество
а если товар новый то парсим все данные.Выдает в результате только 2000 товаров а многие подгруппы имеют намного больше товаров, поэтому тут надо как то обойти эти ограничения или наверное надо проваливатся глубже в группу уровня 3 или 4 чтобы или возможно вы сумеете обойти это
САЙТ ПОД ЗАЩИТОЙ ClodFlare
The parser must be installed on a VPN server and run automatically on a schedule by the CRON scheduler
The result of the work - several YML(XML) files in the PROM format - description of the format here
Парсер должен устанавливаться на VPN сервере и запускаться автоматически по расписанию планировщиком CRON
Результат работы - несколько файлов YML(XML) в формате ПРОМ - описание формата тут
Client's review of cooperation with Artem Plakha
Parsing the website conrad.pl into xml on a scheduleI recommend for collaboration )
Freelancer's review of cooperation with Dmitry Chenkov
Parsing the website conrad.pl into xml on a scheduleThis is the first time we are working with Mr. Dmytro, everyone is satisfied. Thank you for the clearly defined task, open communication, and generous tips.
Current freelance projects in the category Data Parsing
A specialist in Telegram promotion is required.
29 USD
Tasks: invite real users from the username database to new chats and send messages to the target database. Only quality traffic and work with a live audience are of interest — performers using bots, fake engagement, or low-quality methods are requested NOT TO DISTURB. Work… Data Parsing, Social Media Marketing (SMM) ∙ 1 day 13 hours back ∙ 6 proposals |
Collection of B2B database of companies in Germany
40 USD
Goal: To obtain a list of potential employers (clients) for B2B mailing. Region: Munich (München) + radius of 50 km. Required niches: Construction companies (Bauunternehmen) Food enterprises (Lebensmittelhersteller, meat processing plants, bakeries) Logistics and… Data Parsing, Lead Generation & Sales ∙ 1 day 15 hours back ∙ 27 proposals |
Carrier databaseInterested in compiling a database of carriers in Ukraine for the year 2026, including tankers, tarpaulins, grain carriers, and others. It is preferable to develop a table. Information Gathering, Data Parsing ∙ 1 day 17 hours back ∙ 30 proposals |
Consultation on parsing Instagram account subscribersHello. It is necessary to conduct a preliminary assessment of the feasibility of the following task. I have a list of Instagram accounts. The goal is to obtain contact information (primarily email addresses) of users who follow these accounts. Previously, I encountered companies… Data Parsing ∙ 5 days 8 hours back ∙ 12 proposals |
A specialist is needed to find contacts of decision-makers in Ukraine.It is necessary to gather a database (or ready database) of contacts of decision-makers (DMs) in companies in Ukraine. Information Gathering, Data Parsing ∙ 5 days 13 hours back ∙ 18 proposals |