TemuujinE

RazyDave TemuujinE

8 followers · 24 following

Starred repositories

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,275 358 Updated Jun 27, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,247 1,501 Updated Jun 26, 2025

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 25,912 1,991 Updated Jun 27, 2025

lark-parser / lark

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Python 5,360 441 Updated May 8, 2025

sammcj / vlm-ui

Web Interface for Vision Language Models Including InternVLM2

Python 22 3 Updated Jul 29, 2024

langchain-ai / agent-protocol

Python 398 29 Updated May 15, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 13,060 939 Updated Jun 27, 2025

deepseek-ai / DeepSeek-V3

Python 97,892 15,927 Updated Jun 27, 2025

fixie-ai / ultravox

A fast multimodal LLM for real-time voice

Python 4,061 318 Updated Feb 14, 2025

microsoft / DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,237 430 Updated Jul 25, 2024

karolpiczak / ESC-50

ESC-50: Dataset for Environmental Sound Classification

Python 1,589 302 Updated Mar 20, 2024

Alpha-Innovator / ChartVLM

Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

Python 226 20 Updated Sep 26, 2024

gokayfem / awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

Markdown 894 43 Updated Feb 24, 2025

SamuelSchmidgall / AgentLaboratory

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 4,568 656 Updated Mar 27, 2025

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 54,589 9,018 Updated May 30, 2025

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 944 144 Updated May 19, 2025

iperov / DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Python 18,227 577 Updated Nov 13, 2024

desh2608 / diarizer

Clustering-based methods for overlapping diarization

Python 80 9 Updated Jan 12, 2024

hcook / gmm

A specializer for Gaussian Mixture Models, based on the ASP framework

Python 43 13 Updated Aug 2, 2012

tango4j / Python-Speaker-Diarization

Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"

Python 12 5 Updated Apr 6, 2020

tango4j / Auto-Tuning-Spectral-Clustering

This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"

Python 121 15 Updated Apr 8, 2022

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 5,897 569 Updated Jun 19, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 84,037 10,224 Updated Jun 26, 2025

lablab-ai / Whisper-transcription_and_diarization-speaker-identification-

How to use OpenAIs Whisper to transcribe and diarize audio files

Jupyter Notebook 345 46 Updated Oct 12, 2022

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 16,470 1,761 Updated Jun 27, 2025

wq2012 / SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Python 533 71 Updated Sep 25, 2024

wq2012 / awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,767 231 Updated Oct 16, 2024

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 4,670 431 Updated Apr 22, 2025

google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python 421 38 Updated Mar 31, 2025

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,143 186 Updated Jun 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly