Hybrid Workflow (Scraping + Claude 3.5 API) for mass generation of product cards
I'm looking for an experienced n8n developer/integrator to create an automated pipeline for e-commerce.
Task essence: The system should take minimal input data from Google Sheets (links to donors + raw text), perform parsing, process the array through a multi-agent AI system, and output a ready XML/YML feed for import to the marketplace.
Technical stack: n8n (on VPS), Google Sheets / Drive API, Claude 3.5 Sonnet API, scraping gateways (ScrapeOps / ZenRows or similar).
Short work algorithm (4 stages):
Initialization: Reading a row in the table, creating folders for media on Google Drive.
Parsing: Bypassing Cloudflare protection on donor sites. Collecting photos, brands, and characteristics (in a key-value array). Collecting competitor texts.
AI generation (Claude API): The concatenated data array goes through 3 sequential AI nodes (working strictly in JSON mode). The AI extracts facts, matches categories according to our dictionaries, writes SEO-optimized HTML text in two languages, and generates tags.
Export: Assembling all data into a ready XML/YML feed (or a separate Google Sheets sheet) with strict commercial defaults (currency, balance 0, etc.). Setting up Error Handling (so that in case of a parser error, the workflow logs the error and continues instead of failing).
Conditions: I have a fully written, maximally detailed technical specification. You won't have to invent logic, matching architecture, or write system prompts for the AI - all of this is already prepared and will be handed over to the executor. Your task is to correctly assemble these nodes in n8n, set up stable data transfer, and connect servers/proxies.
Provide your Telegram and answer two short questions:
What service/approach would you use in n8n for parsing a site protected by Cloudflare?
How would you configure the request to the Claude API to ensure it always returns a stable data structure (for example, for tags or categories) that won't break subsequent nodes in n8n?
I look forward to your feedback with a preliminary estimate of timelines and budget!
Applications 1
Client's review of cooperation with Illia Dunaiev
Hybrid Workflow (Scraping + Claude 3.5 API) for mass generation of product cards-
I don't pass Telegram - both parties can get blocked on the marketplace for that =/ It's better to keep everything here for now.
Regarding Cloudflare, I wouldn't do the main parsing with a local browser on a VPS. I would connect ZenRows or ScrapeOps through HTTP Request with JS rendering in n8n, using managed proxies, limits, retries, and proper logging. If the donors are very different, I would move the parser to a separate service and keep n8n as the orchestrator.
For the Claude API, I wouldn't rely solely on the prompt. Stability is achieved through tool use with JSON schema, temperature 0, a list of allowed categories and tags, schema validation in n8n, retrying the request on error, and quarantining the string if the model returns invalid data again. Then the subsequent nodes receive only validated fields.
The timeline estimate is 10-14 working days after access and test donors. As for the budget, a complete setup of such a conveyor is likely to start from 45,000 UAH. !!6,500 UAH is enough for a short prototype or audit of the technical specifications!!, not for stable configuration with logs, feed, proxies, and error handling.
My approach is to first establish a basic chain with 1-2 donors, check the data schema, then add category dictionaries, generate in two languages, assemble XML/YML, and monitor errors. We won't complicate things where n8n can handle it well, but we will extract code for what will become fragile in the nodes.
…
Need to clarify:
> how many donor sites and products need to be processed per day
> category dictionaries in the technical specifications - are these marketplace IDs or text names that still need to be matched?
Similar examples:
> https://business.ingello.com/prime-eva - e-commerce and product processes, close to feeds and operational automation
> https://business.ingello.com/vorfahr - AI automation with data processing and sequential logic
> https://systems-fl.ingello.com/ua - our FLH landing page on system development, integrations, and automation
-
472 1 0 Доброго дня! 3-тє місце соло на AI Agent Olympics Hackathon (AI Week 2026 Milan, найбільша AI-подія Європи, 731 команда). Production multi-agent + RAG системи на n8n + Claude - основний стек, MSc Strategic PM, PRINCE2.
По питаннях:
Cloudflare bypass: Scrapfly або ZenRows як основний шлюз через HTTP Request ноду (residential IP rotation + browser fingerprinting, ~65% domain coverage в 2026). Bright Data як fallback на складних доменах. ScrapeOps теж робочий варіант, але дешевший Scrapfly зазвичай дає кращу success rate на Cloudflare/DataDome.
Стабільний JSON від Claude: використовую нативний Structured Outputs (output_config.format з JSON schema) - це офіційна фіча Claude API, не prompt engineering. Гарантує валідний JSON під schema, наступні ноди в n8n не падають. Плюс fallback: JSON Parse нода з try/catch + retry з temperature 0. До речі, для production рекомендую Sonnet 4.6 або Opus 4.7 замість 3.5 - кращий JSON compliance та дешевше за токен на 4.6.
Орієнтовно: 7-10 днів, 15 000 грн.
…
Портфоліо у профілі
-
919 4 0 Hello, Georgiy, I implemented my last project using scrapling and LLM integration.
Let's go step by step:
1. For integration with Google Sheets, you can use the node with the official API.
2. Bypassing CAPTCHA, there are currently two approaches that work:
1) Use scrapling with their built-in DynamicFetcher with stealthy settings.
2) Use CDP with CloakBrowser — the most reliable option.
3. You can connect in three ways:
1) Write a lightweight backend and call it (reliable and stable)
2) Write your own node (time-consuming)
… 3) Use the Execute Command node (but this is less stable than the first option)
4. For LLM responses, there are built-in validations for structure output. You can also connect via openrouter (they have additional protection for this).
So, my recommended option is to create your small backend and call it using n8n. This fits well with the logic you described.
I would be happy to work with you!
-
3067 11 0 1 Good day, Georgiy!
I specialize in developing automated AI/workflow systems, integrations, and building pipelines on n8n using LLM models, parsers, and third-party APIs. I work on tasks where data processing stability, scalability, and structure control between nodes are important. Your case looks like a well-thought-out production pipeline, and this is exactly the type of task where n8n fits very well as an orchestration layer.
To provide you with an objective estimate on timelines and costs, I would like to clarify a few points:
— What is the estimated processing volume planned (products/donors per day)?
— Are all donors uniform in structure, or will separate mapping/parsing be needed for different sites?
— Is the XML/YML feed formed for a specific marketplace or under your own standard?
… — Is caching/saving of intermediate results needed (for example, parsed raw data or AI output)?
— Is the VPS already set up for n8n, or does this also need to be configured?
Regarding your questions:
To bypass Cloudflare in n8n, I would use ScrapeOps or ZenRows with proxy rotation and browser rendering. If the donors are more complex — Playwright/Puppeteer through a separate microservice that n8n calls via webhook. This provides a significantly more stable parsing flow in production.
For the Claude API, I would use strict JSON mode under schema validation: clear system prompt + predefined JSON schema + an additional validation layer in n8n (IF/Function node), so that no incorrect response breaks subsequent nodes. For critical structures (categories, tags) — also fallback/retry logic.
After your responses, I will be able to accurately assess the scope of work, propose an optimal architecture, and provide realistic timelines and budget.
Georgiy, I suggest discussing the details in private messages — I can help assemble not just a workflow, but a stable production-ready pipeline that will scale without constant manual adjustments.
-
976 4 0 Доброго дня
Мене звати Дмитро. Маю досвід з n8n, AI automation, Claude API, парсингом, workflow orchestration та e-commerce автоматизаціями.
Ваш pipeline виглядає дуже грамотно побудованим — особливо подобається підхід із JSON-only AI flow та розділенням логіки по етапах.
Можу допомогти з:
— побудовою та стабілізацією n8n workflow
— інтеграцією Google Sheets/Drive API
— Claude API orchestration
— Cloudflare bypass parsing
— XML/YML feed generation
… — error handling / retry logic
— queue processing
— VPS deployment та proxy setup
Відповіді на питання:
1. Для Cloudflare parsing у n8n найкраще використовував би ZenRows або ScrapeOps + fallback strategy через rotating proxies/browser rendering. Для складних кейсів — headless browser node (Playwright/Puppeteer через external worker).
2. Для Claude API використовував би strict JSON schema approach:
— system prompt із жорсткою структурою
— JSON mode / prefilled response template
— validation node після Claude
— fallback parser + auto-retry при invalid JSON
— enums/allowed values для категорій і тегів
Так workflow не буде ламати наступні вузли.
По строках: орієнтовно 1–3 тижні залежно від складності парсерів та кількості donor sources.
Бюджет: після перегляду повного ТЗ.
-
1904 29 1 1 Hello! I have relevant experience with similar tasks, and I can help with your project. I can quickly familiarize myself with the requirements, propose the best solution, and execute the work thoroughly. I am ready to discuss the details and get started.
-
650 2 0 Good day! 👋
Regarding your questions:
To bypass Cloudflare — ZenRows or ScraperAPI via HTTP Request node with proxy rotation.
Stable structure from Claude — system prompt with a strict JSON schema + temperature: 0 + validation in Function node with a retry on invalid JSON.
We will implement everything — all 4 stages from reading Google Sheets to the finished XML/YML feed. We are ready to start immediately. We will discuss the details in private 🤝
-
253 Hello! We are a team of developers with 4 years of experience in bot development, process automation, and professional data parsing in Python. Hybrid workflows based on Claude 3.5 are one of the best ways to effectively extract and structure frequently changing information. We will take on the full cycle: from writing a reliable scraper (scripts to bypass blocks) to integrating a neural network for intelligent analytics of the collected content. Since we are a team, you will receive a tested and fault-tolerant turnkey solution with a well-thought-out architecture. Let's discuss the data volumes and target sites in private messages for precise system configuration!
-
256 Hello! Our team has 4 years of experience in automation, data parsing, and developing intelligent bots. We specialize in Python and building complex information gathering systems, so integrating Claude 3.5 for processing unstructured data in a hybrid workflow is our core task. We understand how to optimize prompts, bypass website protections using proxies, and set up flexible content filtering logic through the Antropic API. We will provide clean code, high parsing speed, data protection, and stable architecture operation regardless of changes in the structure of target resources. We are ready to discuss the pipeline logic and output data format right now. When would it be convenient for you to chat?
-
2008 36 3 Good day, I can implement everything turnkey without any problems. I have over 5 years of experience in automating business solutions using Python, have worked with large brands, and developed more than 50 automations, with and without AI. I would like to take a closer look at the technical task; if it is written using AI, 70% will need to be redone since they often do not select the right technologies or solutions. I will guide you through each stage and answer any questions. Feel free to reach out!
-
9984 117 0 Hello.
I can set up a workflow for you. Write to me, and we will discuss.
-
196 We already have a nearly ready similar solution for AI generation of cards and feeds, which can be quickly adapted to your tables, donors, and marketplace rules. ))
In terms of timing, I would estimate about 5-7 days for the first stable phase.
Regarding the budget - starting from 18,000 UAH for a working conveyor in n8n with logs, structured JSON, and export to XML or YML.
6,500 UAH will likely only be enough for a prototype without a proper error handling buffer for parsing.
I won't be able to switch to Telegram within the first contact - the marketplace may ban both parties.
We can agree on everything here.
… For Cloudflare, I would use ZenRows, ScrapeOps, or Apify with browser rendering, proxy rotation, retries, and result caching.
Regarding the Claude API - due to the strict data schema.
I would set up separate validation nodes after each AI step, JSON schema, a limited list of categories from dictionaries, a retry request for invalid structures, and a fallback scenario when a field is not defined.
This way, the subsequent n8n nodes won't break due to an extra comma or unexpected format.
> Question 1 - Is the final feed format for a specific marketplace already defined in your specifications, or will we need to verify the mandatory fields during implementation?
> Question 2 - How many donors and approximately how many products need to be processed in one run?
Similar cases from Ingello:
> https://business.ingello.com/vorfahr - automation with AI and data processing
> https://business.ingello.com/prime-eva - e-commerce and systematic work with product data
> https://business.ingello.com/fractal - multi-agent processes and complex automations
Main page for FLH:
> https://systems-fl.ingello.com/ua
I can take the first phase - to assemble a working scenario in n8n, connect the table, parser, Claude, JSON validation, error logging, and test feed export.
In touch ))
-
726 9 1 Hello! I have reviewed your project and am ready to start working immediately. I guarantee excellent quality and promptness.
-
1318 2 0 Hello. Four stages with Cloudflare bypass and three consecutive Claude nodes in JSON mode is not a typical n8n flow; here you need to think about retry logic and edge case handling at the data level, not just connecting nodes. Over the past year, I have built two similar pipelines for e-commerce: one using ScrapeOps + n8n for 800+ SKUs daily, and the other with ZenRows + OpenRouter for multi-core parsing. Both produce YML feeds without manual intervention. For Cloudflare - ZenRows with JS rendering, because ScrapeOps often fails on complex challenges. For stable JSON from Claude - a separate system prompt with an example (few-shot) plus a validation node using a zod schema before passing it on; if Claude returns broken JSON - the node catches it, logs it, and retries with a temperature of 0.1. Regarding timelines - 3-5 working days, as the technical specification is already ready, but Cloudflare bypass always requires testing on real donors. Question: how many donor sites are approximately in the pool and are there any among them that use PerimeterX or DataDome? This will affect the choice of gateway and the number of test iterations.
-
Що з бюджетами тут сталось, чому всі хочуть купити хогвартс по ціні однушки у Житомирі...
-
Current freelance projects in the category Databases & SQL
Power BI
16 USD
The dashboard runs on a Google server, we need to transfer all logic + code Required skills: deployment of ETL processes on a Linux server + working with BigQuery and Postgre There is working code that runs on Google, where the database itself is, we need to transfer it to… Python, Databases & SQL ∙ 1 hour 48 minutes back ∙ 3 proposals |
Work with BAS CORPHello! Currently, the following tasks need to be completed: Adjustment of the printed form in BAS KORP according to the client's parameters. Specifically, the task is not to display certain fields. Updating the classifier of professions in BAS KORP for a number of… Payment Systems Integration, Databases & SQL ∙ 18 hours 42 minutes back ∙ 8 proposals |
Need a 1C specialist for refinements and development.I am looking for a 1C specialist for freelance collaboration. I am currently working with a contractor who provides support and maintenance for the 1C system. However, due to the contractor's workload, there is a need for prompt execution of additional tasks, improvements, and… Databases & SQL ∙ 2 days 1 hour back ∙ 11 proposals |
Restoring Instagram after a banOn March 3, 2026, my Instagram was blocked due to a violation of rules. I believe there has been some mistake, as the page was personal with photos and videos of my personal trips. I did not post any comments anywhere. I have sent several appeals. I cannot download my files… Databases & SQL, Web Programming ∙ 2 days 12 hours back ∙ 5 proposals |
Eliminate the issue of incomplete data import from Excel files of Nova Poshta specifications into 1C:
45 USD
Goal: There is a processing in 1C for uploading the specifications of Nova Poshta. For unclear reasons, it has stopped loading some tabular data. We need to find the reason and eliminate the problem of incomplete data import from Excel files of Nova Poshta specifications into… System & Network Administration, Databases & SQL ∙ 5 days 19 hours back ∙ 16 proposals |