8000 Chen1399 (JIJIN CHEN) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Chen1399's full-sized avatar
  • HangZhou

Block or report Chen1399

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Natural Language Web

Python 4,343 389 Updated May 28, 2025

AudioTrust: Benchmarking the Multi-faceted Trustworthiness of Audio Large Language Models

Python 136 15 Updated May 23, 2025

针对网络安全领域的命名实体识别和关系抽取系统

Python 131 83 Updated May 15, 2025
Python 96 5 Updated Apr 28, 2025
Python 375 33 Updated May 6, 2025

Speech, Language, Audio, Music Processing with Large Language Model

Python 812 79 Updated Apr 24, 2025

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,047 115 Updated May 28, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 2,994 240 Updated May 28, 2025

Update ASR paper everyday

Python 222 12 Updated May 29, 2025

AI Manus is a general-purpose AI Agent system that supports running various tools and operations in a sandbox environment.

Python 436 87 Updated May 26, 2025

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

Python 255 43 Updated May 22, 2025

The official implementation of "2025ICLR Dynamic Diffusion Transformer" and "2025ArXivDyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation".

Python 35 4 Updated Apr 10, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,690 235 Updated May 28, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 16,510 1,321 Updated May 28, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,307 2,240 Updated Feb 1, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…

C++ 6,151 706 Updated May 29, 2025

SkyReels-A2: Compose anything in video diffusion transformers

Python 526 45 Updated Apr 22, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 2,636 308 Updated May 27, 2025

A collection of MCP servers.

51,922 3,895 Updated May 28, 2025
Jupyter Notebook 120 19 Updated Oct 25, 2021
Python 15 Updated Apr 2, 2025

MediaTek's TFLite delegate

C++ 45 6 Updated Apr 4, 2024

c:geo - The powerful Android geocaching app.

Java 1,431 579 Updated May 28, 2025

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 16,834 2,311 Updated May 27, 2025

Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理

Python 61 5 Updated May 17, 2024

【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集

200 10 Updated Apr 10, 2025

COG-MHEAR Audio-Visual Speech Enhancement Challenge

Python 40 11 Updated May 7, 2025

🚀 阿里通义千问2.5大模型逆向API【特长:六边形战士】,支持高速流式输出、无水印AI绘图、长文档解读、图像解析、联网检索、多轮对话,零配置部署,多路token支持,自动清理会话痕迹,仅供测试,如需商用请前往官方开放平台。

TypeScript 922 250 Updated May 21, 2025

GitHub's official MCP Server

Go 14,552 949 Updated May 29, 2025
Next
0