OPEN SOURCE

Transformative Extensions Network (TEN)

TEN is an open-source framework for real-time, multimodal conversational AI, supported by Agora and the TEN community.

Build and deploy conversational AI with complete flexibility

Real-time multimodal interaction

Create AI agents that can see, hear, and speak in real time, providing a natural conversational experience.

Vendor-neutral AI

Choose from LLMs like OpenAI, Llama, DeepSeek, Gemini, and your choice of tech for speech-to-text, text-to-speech, image generation, AI avatars, and more.

Deploy agents at scale

Deploy agents on your backend while delivering ultra-low latency voice and video using Agora’s global Software-Defined Real-Time Network (SD-RTN).

Build faster with Agora's Conversational AI Engine

Agora's hosted platform for voice AI, powered by TEN, means you don't need to worry about deployment and scalability.

TEN Components

Voice Activity Detection (VAD)

Detect whether a human voice is present in an audio frame or not with a lightweight, pre-trained voice activity detection (VAD) model based on deep learning.

Turn Detection

Detect natural turn-taking cues and enable intelligent interruption handling with an advanced turn detection model designed specifically for voice communication between humans and AI agents.

Use cases

Build advanced conversational AI experiences for any use case

The TEN Framework powers a diverse range of conversational AI use cases across industries.
언어 학습
Facilitate language learning with AI tutors that reinforce lessons and enable stress-free practice.
게임 내 음성 AI
Add voice AI powered characters or companion players to games and increase player engagement.
감정적 동반자 관계
AI 동반자에게 맞춤형 성격과 아바타를 제공하세요.
시각적 요소를 활용한 스토리텔링
Generate stories accompanied by images for immersive experiences.
AI customer service
AI 에이전트를 통해 실시간으로 고객의 질문에 답변할 수 있는 파워 콜 센터와 고객 서비스를 운영하세요.
Multimodal AI assistants
사용자와의 상호작용에서 볼 수 있고, 들을 수 있으며, 말할 수 있는 AI 어시스턴트를 개발합니다.
OSZAR »