8000 mrjunjieli (李俊杰) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View mrjunjieli's full-sized avatar
😀
Focusing
😀
Focusing
  • TianJin University
  • TianJin, China

Organizations

@TJUCocktailParty

Block or report mrjunjieli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository of SepReformer for speech separation

Python 197 23 Updated Jan 13, 2025

Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.

Python 476 28 Updated May 6, 2025

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 19,315 2,330 Updated Feb 25, 2025

Awesome-LLM: a curated list of Large Language Model

23,112 1,925 Updated May 9, 2025

这是一份入门AI/LLM大模型的逐步指南,包含教程和演示代码,带你从API走进本地大模型部署和微调,代码文件会提供Kaggle或Colab在线版本,即便没有显卡也可以进行学习。项目中还开设了一个小型的代码游乐场🎡,你可以尝试在里面实验一些有意思的AI脚本。同时,包含李宏毅 (HUNG-YI LEE)2024生成式人工智能导论课程的完整中文镜像作业。

Python 2,342 253 Updated Mar 26, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 10,344 1,039 Updated May 8, 2025

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Python 18,899 1,924 Updated May 10, 2025

Open-Unmix - Music Source Separation for PyTorch

Python 1,368 196 Updated Jun 17, 2024

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

Python 536 18 Updated Jan 9, 2025

Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)

Python 17 Updated Feb 28, 2025

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 1,103 108 Updated Apr 19, 2024

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Python 739 59 Updated Sep 13, 2023

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 16,264 1,919 Updated May 9, 2025

Scaling Diffusion Transformers with Mixture of Experts

Python 318 14 Updated Sep 9, 2024

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Jupyter Notebook 2,395 478 Updated Mar 4, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 9,306 1,135 Updated Oct 9, 2024

Audio-FLAN

144 4 Updated Mar 6, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,755 275 Updated Apr 14, 2025

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,130 1,124 Updated Mar 21, 2025

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,992 168 Updated Apr 19, 2025

An implementation of EQFace: A Simple Explicit Quality Network for Face Recognition (https://arxiv.org/abs/2105.00634, CVPRW 2021)

Python 191 33 Updated May 14, 2021

A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)

Jupyter Notebook 752 93 Updated Dec 8, 2022

unofficial implementation of the High Fidelity Neural Audio Compression

Python 155 14 Updated Aug 15, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,433 139 Updated Jul 11, 2024

(ICCV 2019) Uncertainty-aware Face Representation and Recognition

Python 345 59 Updated Aug 8, 2019
Jupyter Notebook 753 133 Updated Jul 10, 2024
Python 163 23 Updated Dec 5, 2024

OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.

Python 362 24 Updated May 10, 2025

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,681 325 Updated Jan 4, 2024

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 7,363 497 Updated May 10, 2025
Next
0