We have hosted the application ims toucan in order to run this application in our online workstations with Wine or directly.


Quick description about ims toucan:

IMS-Toucan is a toolkit for training, using, and teaching state-of-the-art text-to-speech systems, built at the Institute for Natural Language Processing (IMS), University of Stuttgart. It is the official home of ToucanTTS, a massively multilingual TTS system designed to support over 7,000 languages with a single unified framework. The toolkit focuses on being fast and controllable while not requiring huge amounts of compute, making it practical for research labs and smaller teams. It includes complete pipelines for preprocessing datasets, training models, and running inference, plus a storage configuration system to manage where models and caches are stored. IMS-Toucan ships with several ready-to-run scripts, including GUIs for interactive demos, prosody override tools, zero-shot language embedding injection, and text-to-audio file generation. Pretrained models are automatically downloaded when needed, and there is an online demo instance hosted on GPU that anyone can try.

Features:
  • Massively multilingual TTS toolkit supporting over 7,000 languages
  • End-to-end training and inference pipeline with dedicated scripts for each stage
  • Interactive GUIs for simple and advanced control over prosody, style, and output
  • Automatic download and management of pretrained models via integrated storage config
  • Optional eSpeak-NG integration for robust phonemization across many languages
  • Hugging Face-hosted demo and dataset for quick experimentation and benchmarking


Programming Language: Python.
Categories:
Text to Speech

Page navigation:

©2024. Winfy. All Rights Reserved.

By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.