A webui for different audio related Neural Networks
-
Updated
May 19, 2025 - Python
8000
A webui for different audio related Neural Networks
PALLAIDIUM - a generative AI movie studio integrated in the Blender Video Editor.
🔊 Text-Prompted Generative Audio Model with Gradio
一个非常轻量的通知网关,可以聚合各种推送渠道,使用 Serverless 部署,几乎零成本运行。
a self-hosted webui for 30+ generative ai
Plugin that lets you ask questions about your documents including audio and video files.
🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark
A Python Flask-based web UI designed to facilitate the generation of text-to-speech using Suno AI's Bark.
A Multimodal Discord bot with machine learning functions, including LLM chat, Image generation, and Speech Generation capabilities
The project focuses on leveraging technology to create new courses, personalize existing ones, and enhance the assessment process, ultimately contributing to the development of 21st-century skills in students.
jBark is a powerful Python library that builds upon the capabilities of the original Bark text-to-speech project [https://github.com/suno-ai/bark], adding simple voice conversion features. It provides a seamless interface for generating high-quality speech from text, extracting basic voice characteristics, and more...!
A TTS app where you can clone the voices of any person you wish.
This repository includes integrations of both a Platground(UI) and API endpoints, allowing us to fully utilize the capabilities of the model - suno-ai/bark repository,a cutting-edge text-to-audio transformer model. To facilitate easy deployment, I have also written a Dockerfile, enabling us to host the model within a container on our server
Add a description, image, and links to the bark topic page so that developers can more easily learn about it.
To associate your repository with the bark topic, visit your repo's landing page and select "manage topics."