Freelance projects

Freelance projects

Optimization of expenses on LLM API (GPT and others) in real time

AI & Machine Learning, Python — incorrectly specified categories?

600 USD

Project translated automatically. Log in or register, to view the original

It is necessary to reduce the cost of using text LLM models (ChatGPT, GPT-4.1/5 and analogs) by 50–70% from the official price.

The optimization must work in real time — that is, the client sends a request and receives an answer immediately through the optimized layer, without manual adjustments.

1) Have specific experience in optimizing API costs

2) Immediately provide how the solution will be implemented and what will be required from my side.

No fluff or general words — only a specific action plan.

Proposals 8 Discussions 1

Please select

Oleksandr Stinkovyi

117 0

Budget: 600 USD Deadline: 5 days

Hello.

I am a NodeJS developer. I am ready to take it on. Write to me, we will discuss.

Nikita Kliuchnyk

0 0

Projects -
Rating -
Rating 239

Budget: 600 USD Deadline: 5 days

Hello!
I can reduce costs on LLM (ChatGPT, GPT-4.1/5) by 50–70% in real time through an intermediary layer:
Action plan:
Proxy service between the client and LLM.
Query optimization: prompt compression, token limitation, caching of repeated requests.
Choosing cheap models for non-critical tasks.
Integration via REST/WebSocket, token usage analytics.
What I need from you: API keys, access to client infrastructure, and an acceptable compromise between cost and quality.

Symon Baikov

25 1

Budget: 800 USD Deadline: 5 days

Hello, Ivan! 👋

I have reviewed your task and propose a clear action plan for optimizing expenses on LLM API:

Implementation Plan

Analysis of current requests

Study of the frequency, structure, and volume of requests to LLM API.

Identification of redundant tokens and unnecessary calls.

Request optimization

Minimization of prompts through templates and dynamic substitution.

Reduction of tokens through text preprocessing.

Caching of repeated requests in real time.

Optimization architecture

Implementation of a middleware layer between the client and the API.

Realization of buffering and reuse of responses.

Connection of cheaper models (GPT-3.5, fine-tuned models) for simple tasks with fallback to GPT-4.

Integration and testing

Implementation of the layer in a production environment.

Load testing with measurement of actual savings.

Gradual rollout without stopping current services.

Expected result

Reduction of expenses by 50–70% without loss of response quality.

Fully operational in real time — the client receives an optimized response without manual adjustments.

Flexible configuration for specific scenarios.

I am ready to start working immediately and provide a detailed roadmap for implementation after your confirmation.

Tetyana B.

1 0

Projects -
Rating -
Rating 483

Budget: 600 USD Deadline: 15 days

Hi,

I can create a Python/FastAPI proxy that lowers GPT and other LLM API costs by 50–70 % in real time. The layer will cache and reuse responses, compress prompts, and route traffic to cheaper or open-source models when possible, all without adding latency. From you I only need API keys, sample requests, and a server or cloud account. The endpoint will fully replace direct GPT calls so no manual changes are needed on your side. I can start as soon as I review your usage data.

Mikolay Lugovy

1 0

Projects -
Rating -
Rating 488

Budget: 1000 USD Deadline: 14 days

Hello!
My name is Mykola, I represent the ILMOX team — a full-cycle development and support of IT solutions. We help businesses and startups implement any digital projects: from MVP to large-scale systems.

Our main areas:
- Outsourcing / service model — development of websites, web and mobile applications, integrations, automation, support, technical support, consulting, outstaffing.
- Product model — creation of SaaS and mobile applications with various monetization models.
- Partnership and related projects — white label, subcontracting, referral programs.
- UX/UI design, DevOps, marketing support, integration of 3rd-party services.

Why us:
- Flexible terms: Fixed Price or hourly payment.
- Full transparency and quick start of work.
- Experience in various niches and technologies.

If you are looking for a reliable partner for the development or support of your product — we would be happy to discuss the details and send case studies.

Best regards,
Mykola
ILMOX Team

Mykhailo P.

74 1

Budget: 600 USD Deadline: 6 days

Good day. I have experience working with LLM and GPT API. I need to know more specifically the mechanism and essence of the project. I will be happy to help.

Tamara Ibrahim Sule A.

4 0

Budget: 800 USD Deadline: 7 days

Hello!

I can help you reduce training costs for the LLM program by more than half while ensuring prompt response processing, as I have experience solving similar issues.

I only need your API keys and log samples. I will take care of everything else and provide you with a dashboard so you can see the actual savings.

Thank you!

Vadim Tkachenko

1 0

Projects -
Rating -
Rating 739

Budget: 600 USD Deadline: 3 days

Hello, it depends on what you are using LLM for, that is when we can do the optimization itself, the optimization itself will most likely be a reduction of requests to the API itself, but perhaps we can replace the paid model with our own, I just need information from you on what you are using LLM for.

Current freelance projects in the category AI & Machine Learning

Mass processing of product photos using AI

12 proposals 13:53

Not specified
Development of software (ROS 2 / Nav2) for a 4x4 autonomous robot: Computer vision, asymmetric navigation

13 proposals 2 August

Not specified
I am looking for an AI bot developer (ChatGPT/OpenAI)

AI Consulting 80 proposals 1 August

Not specified
Integration of an AI agent in Manychat for processing incoming messages

AI Consulting 50 proposals 31 July

Not specified
Create an SEO system based on n8n

Bot Development 58 proposals 30 July

Not specified

Ivan Petrov
Erevan, Armenia

Projects -
Rating -
Rating 65

Oleksandr Stinkovyi

Nikita Kliuchnyk

Symon Baikov

Tetyana B.

Mikolay Lugovy

Mykhailo P.

Tamara Ibrahim Sule A.

Vadim Tkachenko

Proposals are currently absent

Current freelance projects in the category AI & Machine Learning

Mass processing of product photos using AI

Development of software (ROS 2 / Nav2) for a 4x4 autonomous robot: Computer vision, asymmetric navigation

I am looking for an AI bot developer (ChatGPT/OpenAI)

Integration of an AI agent in Manychat for processing incoming messages

Create an SEO system based on n8n