By using this website, you agree to our Privacy Policy and Terms of Use.
Accept
Craftium.AICraftium.AICraftium.AI
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Font ResizerAa
Craftium.AICraftium.AI
Font ResizerAa
Пошук
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Follow US
  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback
© 2024-2025 Craftium.AI.

OpenAI announced new models for voice synthesis and transcription

The new models can convey emotions in voice and better recognize accents, but will not be open source

Eleni Karasidi
Eleni Karasidi
Published: 21.03.2025
News
OpenAI
SHARE

OpenAI has introduced new generative AI models for transcription and voice synthesis, which are integrated into the API. The new models, named gpt-4o-mini-tts and gpt-4o-transcribe, promise to improve upon previous versions by offering more realistic sound and the ability to customize different speaking styles. For example, developers can instruct the model to speak “like a mad scientist” or with “a calm voice, like a meditation teacher.”

The new models transform text into speech with greater accuracy and can reproduce emotional nuances in the voice. This can be useful for a variety of applications, such as customer support, where it is important to convey apologies or empathy through voice. According to OpenAI representatives, this allows users and developers to control not only what is said, but also how it sounds.

Read also

ChatGPT Pro
ChatGPT reaches 800 million weekly users
OpenAI added support for apps directly in ChatGPT
Sora 2 by OpenAI generates videos with answers to questions

The gpt-4o-transcribe model replaces the previous Whisper model for transcription. It is trained on a diverse set of high-quality audio data, enabling better recognition of accents and various language variations, even in challenging conditions. This significantly reduces the likelihood of errors that previously occurred with Whisper, such as invented words or phrases in transcripts.

Despite the improvements, OpenAI does not plan to openly release the new transcription models. Company representatives note that the new models are significantly larger than Whisper and are not optimal for local use on regular devices. They emphasize the importance of a cautious approach to open sourcing in order to ensure the models meet specific needs.

OpenAI Prepares New Features for Image Generation and API Security
OpenAI enhances its applications by adding social features
Best Chrome Extensions for Downloading ChatGPT Voice Responses
OpenAI launched the Sora 2 model, which allows creating videos with sound
ChatGPT automatically selects a stricter model in sensitive conversations
TAGGED:APIOpenAIVoice generation
SOURCES:techcrunch.com
Leave a Comment

Leave a Reply Cancel reply

Follow us

XFollow
YoutubeSubscribe
TelegramFollow
MediumFollow

Popular News

Kling AI Image
Cheaper, More Stable, Smarter: Kling AI Launches 2.5 Turbo
25.09.2025
Image from Adobe video
Google Nano Banana will appear in Photoshop to enhance image editing
12.09.2025
Image example
The use of Nano Banana in Gemini grows thanks to mini-figurines (+prompt)
16.09.2025
AI tries on masks
ChatGPT received new personalization options for users
18.09.2025
AI schemes
AI Models Learned to Conceal Deception During Safety Checks
18.09.2025

Читайте також

ChatGPT Pulse
News

ChatGPT Pulse offers personalized daily summaries on your smartphone

26.09.2025
Qwen Chat
News

Qwen introduced new models for voice, image editing, and content moderation

24.09.2025
Image from NVIDIA's website
News

OpenAI and Nvidia to Create Large-Scale Infrastructure for Future AI

23.09.2025

Craftium AI is a team that closely follows the development of generative AI, applies it in their creative work, and eagerly shares their own discoveries.

Navigation

  • News
  • Reviews
  • Collections
  • Blog

Useful

  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback

Subscribe for AI news, tips, and guides to ignite creativity and enhance productivity.

By subscribing, you accept our Privacy Policy and Terms of Use.

Craftium.AICraftium.AI
Follow US
© 2024-2025 Craftium.AI
Subscribe
Level Up with AI!
Get inspired with impactful news, smart tips and creative guides delivered directly to your inbox.

By subscribing, you accept our Privacy Policy and Terms of Use.

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?