8000 Whale-Dolphin (Whale and Dolphin) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Whale-Dolphin's full-sized avatar
  • Huazhong University of Science and Technology

Highlights

  • Pro

Block or report Whale-Dolphin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A course on aligning smol models.

Jupyter Notebook 6,028 2,153 Updated Jul 1, 2025

PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 773 42 Updated Jul 16, 2025
Python 1,820 90 Updated Jul 15, 2025

WTF Solidity 极简入门教程,供小白们使用。Now supports English! 官网: https://wtf.academy

Solidity 12,838 2,220 Updated Jul 9, 2025

An intuitive and low-overhead instrumentation tool for Python

Python 969 34 Updated Jul 8, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,243 734 Updated May 27, 2025

LLM Frontend for Power Users.

JavaScript 16,139 3,579 Updated Jul 16, 2025

Windows Subsystem for Linux

C++ 29,240 1,423 Updated Jul 16, 2025
Python 427 41 Updated May 6, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,896 907 Updated Jul 7, 2025

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,989 179 Updated Aug 13, 2024

🚀 QuickGo 外链直达 — 无感知自动跳过知乎、简书、掘金、CSDN、少数派、Gitee 等 50+ 网站的安全中心跳转限制

TypeScript 711 25 Updated Jun 29, 2025

科技爱好者周刊,每周五发布

71,370 3,508 Updated Jul 11, 2025

Open CS Application | 开源CS申请

JavaScript 2,165 251 Updated May 30, 2025

njuphy暑研资料分享

HTML 12 3 Updated Oct 25, 2021

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 147,038 29,664 Updated Jul 16, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,369 285 Updated Nov 5, 2024
5 Updated Feb 21, 2025

Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"

Python 633 57 Updated Feb 24, 2025
Python 12 2 Updated May 26, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 47,656 6,231 Updated Jul 16, 2025

[AAAI 2025] Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding

Python 7 Updated Dec 24, 2024

Versatile Evaluation of Speech and Audio

Python 300 34 Updated Jul 5, 2025

UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts

Python 32 4 Updated Jun 12, 2025

Xiaomi Home Integration for Home Assistant

Python 20,303 1,041 Updated Jul 16, 2025

Genshin Datasets For SVC/SVS/TTS

684 41 Updated May 31, 2025

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…

Python 3,939 601 Updated Jun 4, 2025

Live2D Library for Python (C Extension): Supports model loading, lip-sync, basic face rigging, and precise click test.

C++ 342 37 Updated Jul 15, 2025
Next
0