We have hosted the application heartmula in order to run this application in our online workstations with Wine or directly.


Quick description about heartmula:

HeartMuLa is the open-source library and reference implementation for the HeartMuLa family of music foundation models, designed to support both music generation and music-related understanding tasks in a cohesive stack. At the center is HeartMuLa, a music language model that generates music conditioned on inputs like lyrics and tags, with multilingual support that broadens the range of lyric-driven use cases. The project also includes HeartCodec, a music codec optimized for high reconstruction fidelity, enabling efficient tokenization and reconstruction workflows that are critical for training and generation pipelines. For text extraction from audio, it provides HeartTranscriptor, a Whisper-based model tuned specifically for lyrics transcription, which helps bridge generated or recorded audio back into structured text. It also introduces HeartCLAP, which aligns audio and text into a shared embedding space.

Features:
  • Music generation model conditioned on lyrics and descriptive tags
  • Multilingual lyric support for broader creative workflows
  • High-fidelity music codec for audio tokenization and reconstruction
  • Lyrics transcription model tuned from a Whisper baseline
  • Audio�text alignment embeddings for cross-modal retrieval
  • Reference library with example workflows for inference and evaluation


Programming Language: Python.
Categories:
AI Models

Page navigation:

©2024. Winfy. All Rights Reserved.

By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.