During the Open AI Spring Update Event with a live demo of GPT-4o realtime conversational speech, Mark Chen called out āIf you are wondering about this wire, it is so we have consistent internetā ā yet the real-world internet is anything but consistent.
The demo itself showcased another major leap forward LLMs. The model is a step toward more natural human-computer interactionāit can be interrupted, and it can even understand emotion in a userās voice. Yet to truly be compelling and valuable to users globally and at scale, it needs to work reliably over challenging real-world internet conditions and across the wide range of mobile devices and networks in use today.
One of the primary challenges is the ālast mileā connection between a userās mobile device and Internet Service Provider (ISP). This connection typically has rapid fluctuations in available bandwidth, signal dropouts, higher congestion and packet-loss making it incredibly difficult to deliver reliable and consistent real-time communication (RTC). The same challenges that exist for human-to-human RTC also exist for communication between humans and AI.
For the past decade, Agora has been focused on building out infrastructure and end-to-end technologies to deliver the real-time internet, optimizing performance for natural conversations despite wireless last mile challenges.
Agoraās infrastructure is used by thousands of app developers globally, powering reliable real-time communications on over 3B mobile devices with over 60B minutes per month of audio and video usage. Natural communication between humans and AI requires a stable and reliable real-time internet. Learn more about how Agora helps developers build conversational AI.