The IEEE Real-Time Communications Conference and HackRTC are globally recognized collaborative events where industry and academia connect. This annual conference brings together technical professionals and business executives from the data and telecommunications industry, standards bodies, policy, and regulatory institutions, and academic educators and researchers to promote an open exchange of ideas to lead to future development in the rapidly changing field of real-time communications.
The advent of multimodal large language models has introduced a new kind of participant into human real-time communication (RTC) infrastructure. While these models lack ears, they can still listen; though they have no mouth, they can speak. This raises a compelling question: how will the interfaces for integrating these models differ from the traditional microphones and speakers we use today? And how will these models, as new "customers" of RTC infrastructure, behave compared to humans?
For example, the codecs designed for human auditory processing may need to evolve. LLMs could "speak" at speeds far beyond human capability or process several seconds of speech in just one second, provided the data arrives simultaneously. What new requirements will these capabilities place on RTC infrastructure, and how will it adapt?
With over 700 billion minutes of real-time audio and video running through Agora’s RTC infrastructure annually, we are finely tuned for human-to-human communication. We invite you to join this keynote as we explore the potential of real-time communication using large language models and examine the possibilities and challenges in this innovative form of conversation.
When: Wednesday, Oct 9 2024 | 2:15pm - 3:00pm CT
As conversational AI comes closer to achieving truly natural real-time interactions via voice and video, there are a few major challenges to rolling this functionality out to the public including the last mile, latency, time-to-market and the cost to scale. This presentation delves into what is necessary to bridge the gap between laboratory performance and reliable, cost-effective real-time conversational AI experiences in diverse real-world environments.
When: Wednesday, Oct 9 2024 | 12:30pm - 1:00pm CT