8000 Chen1399 (JIJIN CHEN) / Starred · GitHub

More Web Proxy on the site http://driver.im/

Chen1399

Follow

JIJIN CHEN Chen1399

Follow

Audio Engineer

34 followers · 171 following

HangZhou

Starred repositories

microsoft / NLWeb

Natural Language Web

Python 4,343 389 Updated May 28, 2025

JusperLee / AudioTrust

AudioTrust: Benchmarking the Multi-faceted Trustworthiness of Audio Large Language Models

Python 136 15 Updated May 23, 2025

Sirrrrri / Cyber_NER-RE

针对网络安全领域的命名实体识别和关系抽取系统

Python 131 83 Updated May 15, 2025

qiuqiangkong / audio_flow

Python 96 5 Updated Apr 28, 2025

maitrix-org / Voila

Python 375 33 Updated May 6, 2025

X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Python 812 79 Updated Apr 24, 2025

modelscope / evalscope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,047 115 Updated May 28, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 2,994 240 Updated May 28, 2025

halsay / ASR-TTS-paper-daily

Update ASR paper everyday

Python 222 12 Updated May 29, 2025

Simpleyyt / ai-manus

AI Manus is a general-purpose AI Agent system that supports running various tools and operations in a sandbox environment.

Python 436 87 Updated May 26, 2025

duixcom / Duix.mobile

C++ 6,555 958 Updated May 28, 2025

JusperLee / TIGER

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

Python 255 43 Updated May 22, 2025

alibaba-damo-academy / DyDiT

The official implementation of "2025ICLR Dynamic Diffusion Transformer" and "2025ArXivDyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation".

Python 35 4 Updated Apr 10, 2025

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,690 235 Updated May 28, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 16,510 1,321 Updated May 28, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,307 2,240 Updated Feb 1, 2025

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…

C++ 6,151 706 Updated May 29, 2025

SkyworkAI / SkyReels-A2

SkyReels-A2: Compose anything in video diffusion transformers

Python 526 45 Updated Apr 22, 2025

SkyworkAI / SkyReels-V2

SkyReels-V2: Infinite-length Film Generative model

Python 2,636 308 Updated May 27, 2025

punkpeye / awesome-mcp-servers

A collection of MCP servers.

51,922 3,895 Updated May 28, 2025

BUTSpeechFIT / speakerbeam

Jupyter Notebook 120 19 Updated Oct 25, 2021

BUTSpeechFIT / TS_SUPERB

Python 15 Updated Apr 2, 2025

MediaTek-NeuroPilot / tflite-neuron-delegate

MediaTek's TFLite delegate

C++ 45 6 Updated Apr 4, 2024

cgeo / cgeo

c:geo - The powerful Android geocaching app.

Java 1,431 579 Updated May 28, 2025

HKUDS / LightRAG

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 16,834 2,311 Updated May 27, 2025

yongzhuo / qwen2-sft

Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理

Python 61 5 Updated May 17, 2024

FunnySaltyFish / Better-Ruozhiba

【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集

200 10 Updated Apr 10, 2025

cogmhear / avse_challenge

Forked from claritychallenge/clarity

COG-MHEAR Audio-Visual Speech Enhancement Challenge

Python 40 11 Updated May 7, 2025

LLM-Red-Team / qwen-free-api

🚀 阿里通义千问2.5大模型逆向API【特长：六边形战士】，支持高速流式输出、无水印AI绘图、长文档解读、图像解析、联网检索、多轮对话，零配置部署，多路token支持，自动清理会话痕迹，仅供测试，如需商用请前往官方开放平台。

TypeScript 922 250 Updated May 21, 2025

github / github-mcp-server

GitHub's official MCP Server

Go 14,552 949 Updated May 29, 2025

Starred topics

Tensorflow

0