mrjunjieli

😀

Focusing

李俊杰 mrjunjieli

😀

Focusing

speech separation

46 followers · 52 following

TianJin University
TianJin, China

Achievements

Organizations

Lists (3)

Sort

Stars

dmlguq456 / SepReformer

Official repository of SepReformer for speech separation

Python 197 23 Updated Jan 13, 2025

DataoceanAI / Dolphin

Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.

Python 476 28 Updated May 6, 2025

datawhalechina / llm-cookbook

面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版

Jupyter Notebook 19,315 2,330 Updated Feb 25, 2025

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

23,112 1,925 Updated May 9, 2025

Hoper-J / AI-Guide-and-Demos-zh_CN

这是一份入门AI/LLM大模型的逐步指南，包含教程和演示代码，带你从API走进本地大模型部署和微调，代码文件会提供Kaggle或Colab在线版本，即便没有显卡也可以进行学习。项目中还开设了一个小型的代码游乐场🎡，你可以尝试在里面实验一些有意思的AI脚本。同时，包含李宏毅 (HUNG-YI LEE）2024生成式人工智能导论课程的完整中文镜像作业。

Python 2,342 253 Updated Mar 26, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 10,344 1,039 Updated May 8, 2025

searxng / searxng

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Python 18,899 1,924 Updated May 10, 2025

sigsep / open-unmix-pytorch

Open-Unmix - Music Source Separation for PyTorch

Python 1,368 196 Updated Jun 17, 2024

lucidrains / BS-RoFormer

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

Python 536 18 Updated Jan 9, 2025

TaoRuijie / SEANet

Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)

Python 17 Updated Feb 28, 2025

davidmrau / mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 1,103 108 Updated Apr 19, 2024

lucidrains / mixture-of-experts

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Python 739 59 Updated Sep 13, 2023

camel-ai / owl

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 16,264 1,919 Updated May 9, 2025

feizc / DiT-MoE

Scaling Diffusion Transformers with Mixture of Experts

Python 318 14 Updated Sep 9, 2024

sthalles / SimCLR

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Jupyter Notebook 2,395 478 Updated Mar 4, 2024

lucidrains / denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 9,306 1,135 Updated Oct 9, 2024

lmxue / Audio-FLAN

Audio-FLAN

144 4 Updated Mar 6, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,755 275 Updated Apr 14, 2025

AntixK / PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,130 1,124 Updated Mar 21, 2025

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,992 168 Updated Apr 19, 2025

deepcam-cn / FaceQuality

An implementation of EQFace: A Simple Explicit Quality Network for Face Recognition (https://arxiv.org/abs/2105.00634, CVPRW 2021)

Python 191 33 Updated May 14, 2021

MishaLaskin / vqvae

A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)

Jupyter Notebook 752 93 Updated Dec 8, 2022

ZhikangNiu / encodec-pytorch

unofficial implementation of the High Fidelity Neural Audio Compression

Python 155 14 Updated Aug 15, 2024

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,433 139 Updated Jul 11, 2024

seasonSH / Probabilistic-Face-Embeddings

(ICCV 2019) Uncertainty-aware Face Representation and Recognition

Python 345 59 Updated Aug 8, 2019

mk-minchul / AdaFace

Jupyter Notebook 753 133 Updated Jul 10, 2024

JusperLee / SPMamba

Python 163 23 Updated Dec 5, 2024

ASLP-lab / OSUM

OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.

Python 362 24 Updated May 10, 2025

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,681 325 Updated Jan 4, 2024

ccfddl / ccf-deadlines

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 7,363 497 Updated May 10, 2025

李俊杰 mrjunjieli

Organizations

Lists (3)

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars