site-logo

JAWS PANKRATION 2024

Breaking Language Barriers : Revolutionizing Global Business Communication with AI-Powered Voice Translation

Lv200

Lv200

8/24/2024 06:20 (UTC)

Session Info

Our voice translation platform revolutionizes global business communication by breaking language barriers.

Outperforming major commercial APIs in specific domains, it offers multi-platform support for mobile and desktop applications, with offline functionality for mobile users.

We've developed superior speech-to-text and translation models, complemented by a virtual office platform for seamless collaboration.

Utilizing on-premise training servers and leveraging AWS cloud services, we've implemented state-of-the-art models like Whisper and LLaMA to ensure top-notch performance and scalability.

Already adopted by numerous global companies, our platform significantly reduces miscommunication and facilitates business growth.

As a web developer and AI engineer with WebRTC expertise, I've built a reliable infrastructure for this game-changing tool using AWS's robust ecosystem.

In this presentation, I'll cover our technical architecture, including our strategic use of AWS services, and our business model as a sponsor for this event, demonstrating how we're positioned to transform international business communication.

Akinori  Nakajima

Akinori Nakajima



Session Category
Application intergration
Computing
Database
Machine learning


AWS Services

Session Materials


    Session Summary (by Amazon Bedrock)
      Akinori Nakajima, the CEO of Voice Ping, introduced their product, a real-time voice translation system that enables seamless communication for global teams. He explained the challenges of using interpreters for international business meetings, such as high costs, logistical difficulties, and potential misunderstandings due to unfamiliarity with industry-specific terms. Voice Ping aims to solve these problems by offering real-time translation and subtitles that work seamlessly with video conferencing platforms like Zoom and Google Meet. Their system is built using cutting-edge AI models and technology, including whisper, BART, and GPUs, ensuring high accuracy and smooth real-time performance. Nakajima highlighted the advantages of Voice Ping over other translation tools, such as increased accuracy, support for technical terms, and features like automatic summarization and speaker identification. He mentioned that the product has gained traction, with over 40,000 accounts and 6,000 paid active users, growing at 11.1% per month. Voice Ping targets the B2B interpretation market, which is expected to reach $70.5 billion by 2030. The company plans to expand into B2C markets, create a voice API platform, develop AI glasses, and establish its own GPU data center to support training and deployment of AI models. Nakajima emphasized that Voice Ping's mission is to democratize efficient language communication using AI. The company has secured seed funding and has a talented team of AI developers from companies like SoftBank and IBM. In conclusion, Nakajima invited the audience to try Voice Ping's free real-time translation service and offered to answer any technical questions, highlighting the product's potential to revolutionize global communication and business.

    ©JAWS-UG (AWS User Group - Japan). All rights reserved.