Financial Voice Agent for Call Center

Voice AI Engineer2025Mykhailo Z.

Voice AI Engineer

Key Expertise

Voice AI AgentsReal-time Audio StreamingAgentic RAG SystemsOn-premise LLM DeploymentSpeech EngineeringMultimodal Document AI

Experience

7+ years

Timezone

CET (GMT +1)

Skills

AI / ML

DeepseekRay.ioTTSNVIDIA NeMoTransformersTriton Inference ServerNVIDIA RivaEmbedding modelsLlamaOllamaWhisperLangGraphMistral/MixtralSTTQwenllama.cppAgentic frameworksRAGOCRGeminiLlamaIndexElevenLabsDiarizationPyannoteKAGMLflowvLLMClaudeLangChainPydantic Agents

Languages

Python

Databases

ChromaDBQdrantMongoDBCosmosDBOpenSearchPineconeElasticsearchRedisPostgreSQLFAISS

Infrastructure

KafkaDocker ComposeLangfuseKubernetesSageMakerDockerPydantic’s LogfireEKSLangSmith

Frameworks

Dagstern8nApache Airflow

Integrations & Protocols

RTP over UDPWebSocketLiveKitAsterisk PBXWebRTC

7-day risk-free trial

Response within 24 hours

View Full Profile

Overview

Voice agent integration for a financial services company with a focus on mobile stability. Focus: Integrated AI agents with telephony infrastructure. Solved architectural challenges regarding vendor integrations. Performance: Focused on maintaining high communication quality over mobile networks.

Achievements

• Reduced voice latency from 200–250ms to 60–80ms through codec optimization and routing. • Successfully deployed real-time voice AI agent over unstable mobile networks in Uganda. • Built a production-grade bridge between Asterisk PBX and OpenAI Realtime API. • Achieved stable call quality using UDP protocol with minimal buffering layers.

Responsibilities

Designed and implemented real-time voice communication architecture: Asterisk PBX ↔AudioSocket ↔ OpenAI Realtime API.
Optimized latency by switching from 16-bit PCM to 8-bit codecs (G.711/μ-law).
Configured intermediate server routing for optimal network paths.
Implemented RTP over UDP for production telephony to minimize delays.
Integrated AI agents with telephony infrastructure, resolving vendor-specific challenges.
Built and maintained voice agent flow: call handling, speech recognition, LLM processing, TTS response.

Technologies Used

PythonAsterisk PBXOpenAIOpenAI APIWebSocketDockerRTP over UDP

Mykhailo Z.

Voice AI Engineer

Key Expertise

Voice AI AgentsReal-time Audio StreamingAgentic RAG SystemsOn-premise LLM DeploymentSpeech EngineeringMultimodal Document AI

Experience

7+ years

Timezone

CET (GMT +1)

Skills

AI / ML

Languages

Python

Databases

ChromaDBQdrantMongoDBCosmosDBOpenSearchPineconeElasticsearchRedisPostgreSQLFAISS

Infrastructure

KafkaDocker ComposeLangfuseKubernetesSageMakerDockerPydantic’s LogfireEKSLangSmith

Frameworks

Dagstern8nApache Airflow

Integrations & Protocols

RTP over UDPWebSocketLiveKitAsterisk PBXWebRTC

7-day risk-free trial

Response within 24 hours

View Full Profile

This project was delivered by

Mykhailo Z.

View Full Profile

More Projects by Mykhailo Z.

2024

Interrogation Transcription System for Law Enforcement

Voice AI Engineer

Automated real-time transcription of interviews to generate official protocols in a secure environment. On-premise (air-gapped) deployment ensuring maximum security and data privacy. Core Model: Python, OpenAI Whisper, Pyannote, Docker, on-premise deployment Orchestration: Custom system for real-time processing (voice detection + chunking + transcription). Supports up to 10 simultaneous sessions. Fine-tuning Pipeline: Created a pipeline for periodic model updates using client-provided datasets (edited transcripts). Focused on adapting to (local dialect) and low-quality audio. Metrics: Used WER (Word Error Rate) and CER (Character Error Rate) to validate model performance. Deployment: On-premise (Air-gapped). All components are deployed locally to ensure maximum security and data privacy.

PythonOpenAIWhisperPyannoteDocker

View Details

2023-2024

RAG for Medical equipment marketplace

AI Engineer

Knowledge base system for medical device documentation with semantic search capabilities. Pipeline: Web scraping of manufacturer manuals for specified medical devices → chunking → indexing with metadata → storage in vector database. Core Functionality: On query, retrieves relevant documentation and specifications for a given medical device.

Ray.ioChromaDBLlamaLangChain

View Details

Ready to Build Your AI Team?

Get matched with the right AI experts for your project. Book a free discovery call to discuss your requirements.

Book a Discovery Call Browse All Experts

We respond within 24 hours.

Financial Voice Agent for Call Center

Overview

Achievements

Responsibilities

Technologies Used

More Projects by Mykhailo Z.

Interrogation Transcription System for Law Enforcement

RAG for Medical equipment marketplace

Ready to Build Your AI Team?

Solutions

Gemini Enterprise

Company