AI Engineer for AI Voice Agent Enhancement: STT, TTS, SPEECH RECOGNITION, VOIP, Voice Technology, Twilio
₹37500-75000 INR
Imefungwa
Imechapishwa 28 days ago
₹37500-75000 INR
Kulipwa wakati wa kufikishwa
Project Overview:
We are [login to view URL], an AI voice agent platform focused on managing business calls efficiently. Our AI mimics human interaction to perform tasks such as booking appointments, answering queries, and handling missed calls. We are hiring an expert in STT (Speech-to-Text), VoIP, TTS (Text-to-Speech), Deepgram, Twilio, and Speech Recognition to enhance and optimize our platform.
Gig Details:
Duration: 1 month
Pay: ₹60,000 INR
Preferred Profiles: Candidates with active Fiverr or Upwork profiles will be given preference.
Tasks and Responsibilities:
1. Latency Optimization:
Reduce latency to 800ms or lower for smooth and real-time conversations.
2. Ambient Sound Integration:
Add natural and high-quality background sounds (e.g., coffee shop, call center, office).
Ensure the sounds are seamless and do not interfere with voice clarity.
3. End Call and Transfer Call Functionality:
Implement dynamic functionality to ensure AI understands when to end a call or transfer it based on user intent.
4. Advanced Settings Implementation:
Interruption Sensitivity:
Add a slider (0-1 scale) in the frontend to allow users to adjust sensitivity.
Provide clear documentation in the UI about the setting's purpose.
Reminder Message Frequency:
Allow users to define how often the AI sends reminders during long silences.
End Call on Silence:
Add a timer slider for users to control how long AI waits before ending a call due to silence.
5. Appointment Booking System:
Integrate with [login to view URL] for seamless appointment booking.
Ensure booking flows are smooth and error-free, resembling human interaction.
Confirm appointment details via email notifications to both users and callers.
Implement editable base prompts accessible in the user dashboard for customization.
6. Conversation Analysis Dashboard:
Build a dashboard to display analytics for each call, including:
Call Summary: Separate AI and user dialogue transcription with detailed logs.
Call Metrics: Status (e.g., Successful, Ended), Sentiment (e.g., Positive), Disconnection Reasons, and End-to-End Latency.
7. Dynamic Responses:
Replace static phrases with context-aware, dynamic dialogues for a more natural user experience.
8. Call Audio Recording:
Ensure high-quality call recordings with clear transcriptions and conversational analysis.
Expectations and Quality Standards:
High-Quality Work: All tasks must meet smooth and professional standards.
Dynamic Enhancements: Any additional requirements needed for task improvement must be incorporated proactively.
Seamless Functionality: No bugs or hiccups in the execution of deliverables.
User-Friendly Design: Ensure the frontend is intuitive and simple for users to interact with advanced settings.
Collaboration: Maintain clear communication regarding progress and challenges throughout the gig.
Preferred Qualifications:
Proven expertise in STT, TTS, VoIP, Deepgram, Twilio, and Speech Recognition.
Strong understanding of real-time systems and latency optimization.
Experience with scalable front-end and back-end system design.
Knowledge of conversational AI and NLP technologies.
Familiarity with API integrations for voice systems.
Note:
An audio sample showcasing the desired conversational quality is available and will be shared during onboarding.
If you have the skills, dedication, and passion to deliver high-quality results within the given timeline, we’d love to collaborate with you!
Hello
I'm an AI and full stack developer with 5 years of experience.
I have rich experience in voice call agent.
I'd like to disscus your project in detail.
I'm looking forward to work with you;.
Thank you
Hi
I am Stefan from Serbia. I have carefully read your job description "AI Engineer for AI Voice Agent Enhancement: STT, TTS, SPEECH RECOGNITION, VOIP, Voice Technology, Twilio". I am confident that I will be able to complete your job perfectly with my proficiency in skills such as AWS Lambda, Speech Synthesis, OpenAI, AI Text-to-speech and Whisper AI as I have worked on similar projects before. I have 6 years of experience in handling your project to your satisfaction within the given timeline. I look forward to the opportunity to discuss your project.
Thank you,
Stefan