Switch to English?
Yes
Переключитись на українську?
Так
Переключиться на русскую?
Да
Przełączyć się na polską?
Tak
Post your project for free and start receiving proposals from freelancers within minutes after publication!

Development of a high-load system with fine-tuning of LLM models

Translated

  1. 5093
     30  0
    Work example:
    Mobile app with admin
    45 days7600 USD

    Look, there's a nuance here - this is not just a chatbot task, but a product search system where the cost of architectural mistakes is high =/

    For the timeline, I would allocate the first safe phase for 45 days and 7600 USD - design, testing multimodal search on your products, a prototype agent in messengers, data schema, photo processing queue, search quality metrics, and a plan for model retraining. Full development will depend on the load and catalog, most likely this will be a separate budget starting from 20000 USD.

    The vision for implementation is as follows -
    > normalize the catalog, attributes, photos, and texts of products
    > build multimodal search - vector index, ranking, filters, relevance checking
    > create an agent in messengers that clarifies the request, shows options, and sends events to the admin panel
    > separately assess search quality and data for retraining, otherwise the model will respond beautifully but sell weaker.

    From you at the start, I need the product catalog, examples of photos, descriptions of target messengers, and at least a rough estimate of peak loads. Overall, it’s fine to start not with a large construction project, but with a verifiable phase - first prove the quality of search on real products.

    I will clarify 2 things -
    > what is the volume of the catalog - products, photos, languages, updates per day
    > is retraining the LLM really necessary or can we start with multimodal embeddings, ranking, and RAG, leaving fine-tuning for after accumulating training examples?

    From recent experience -
    > https://business.ingello.com/vorfahr - AI service, working with photos, content generation, and product logic
    > https://business.ingello.com/fractal - agent architecture and automation of complex workflows
    > https://business.ingello.com/prime-eva - product data, manufacturing, integrations, and accounting

    Main page of the team for FLH -
    > https://systems-fl.ingello.com

  2. 893    1  0
    1 day165 USD

    Good day, Nick.

    In brief:

    Your service will receive a multimodal search capable of simultaneously processing photo and text queries through a personal agent in Telegram or WhatsApp. The search core based on a fine-tuned LLM and vision models will ensure high accuracy of results. We will enhance text search with a vector index pgvector to instantly find similar products even with incomplete descriptions. The system will operate on Kubernetes with automatic scaling, so peak loads will not affect speed. Integration through messengers will allow users to send photos and text in chat, and the assistant will return a selection of products from your catalog. Fine-tuning on your data guarantees accuracy unattainable for general models.

    More details:

    Multimodal search requires deep fine-tuning of the LLM for your catalog; otherwise, relevance will remain at the level of generic models. The search core will combine a base model with additional training on your data and a vector index pgvector for semantic search. The vision component will extract features from photos, while text will clarify the context—both streams will merge into a single ranked result. The architecture on Kubernetes with independent scaling of nodes (vision, text, ranking) will ensure stable latency. An event-driven architecture with message queues will protect against query loss during traffic spikes, and the user will receive product cards with direct links.

    The next step is to increase search accuracy through a hybrid RAG approach with fine-tuning. This will allow the agent to instantly update knowledge about stock and prices without retraining the model, which is critical for a dynamic e-commerce catalog.

    Later on, conversion will increase thanks to an analytical module that will track the most popular visual queries. You will gain a clear understanding of what users are searching for to optimize purchasing and marketing.

    I am ready to discuss the details of the catalog, query volumes, and the choice of the base model for fine-tuning.

  3. 1510    10  0
    45 days1300 USD

    We have experience in building high-load architectures and fine-tuning LLM for multimodal tasks. We implement the system through microservices in Python using vector databases for fast search and integrating messenger APIs for agent operation. We will ensure scalability and accuracy in processing requests for photos and text. We are ready to start designing the architecture.

  4. 196  
    35 days25 000 USD

    We already have a practically ready similar solution for AI search and assistants in messengers, which can be quickly adapted and launched for your catalog ))
    For the task, I would estimate the first working stage at 250,000 UAH and about 35 working days for a prototype with photo and text search, ranking, an agent, and basic integration into messengers.
    Look, there’s a nuance here - for the industrial version, it’s crucial to check in advance !!the load, search quality, and catalog structure!!, otherwise fine-tuning can become an expensive toy without noticeable gains.
    For implementation, I would go through a data layer, vector and text search, a separate ranking layer, agent scenarios, logging requests, and quality metrics on the service side.
    From you, I need the catalog export, photos, descriptions, output rules, a list of messengers, and access to the test API.
    I’d like to clarify two points - what is the size of the catalog and what is the target load in requests per minute?
    Which messengers are needed for the first launch and is there already an API for the catalog?
    Similar examples for AI and agents - https://business.ingello.com/vorfahr and https://business.ingello.com/fractal
    As a close example for e-commerce and catalog logic - https://business.ingello.com/prime-eva
    More about us for the exchange - https://systems-fl.ingello.com
    In touch, we can discuss the details within the project =)

  5. Another 2 proposals concealed

Client
Nick Stolyr
United Kingdom United Kingdom
Project published
1 hour 8 minutes back
77 views
Until closing
13 days 22 hours
Tags