8000 TemuujinE (RazyDave) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View TemuujinE's full-sized avatar

Block or report TemuujinE

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Efficient Triton Kernels for LLM Training

Python 5,275 358 Updated Jun 27, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,247 1,501 Updated Jun 26, 2025

DSPy: The framework for programming—not prompting—language models

Python 25,912 1,991 Updated Jun 27, 2025

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Python 5,360 441 Updated May 8, 2025

Web Interface for Vision Language Models Including InternVLM2

Python 22 3 Updated Jul 29, 2024

Toolkit for linearizing PDFs for LLM datasets/training

Python 13,060 939 Updated Jun 27, 2025

A fast multimodal LLM for real-time voice

Python 4,061 318 Updated Feb 14, 2025

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,237 430 Updated Jul 25, 2024

ESC-50: Dataset for Environmental Sound Classification

Python 1,589 302 Updated Mar 20, 2024

Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

Python 226 20 Updated Sep 26, 2024

Famous Vision Language Models and Their Architectures

Markdown 894 43 Updated Feb 24, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 4,568 656 Updated Mar 27, 2025

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 54,589 9,018 Updated May 30, 2025

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 944 144 Updated May 19, 2025

DeepFaceLab is the leading software for creating deepfakes.

Python 18,227 577 Updated Nov 13, 2024

Clustering-based methods for overlapping diarization

Python 80 9 Updated Jan 12, 2024

A specializer for Gaussian Mixture Models, based on the ASP framework

Python 43 13 Updated Aug 2, 2012

Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"

Python 12 5 Updated Apr 6, 2020

This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"

Python 121 15 Updated Apr 8, 2022

Tools for merging pretrained large language models.

Python 5,897 569 Updated Jun 19, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 84,037 10,224 Updated Jun 26, 2025

How to use OpenAIs Whisper to transcribe and diarize audio files

Jupyter Notebook 345 46 Updated Oct 12, 2022

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 16,470 1,761 Updated Jun 27, 2025

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Python 533 71 Updated Sep 25, 2024

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,767 231 Updated Oct 16, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 4,670 431 Updated Apr 22, 2025

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python 421 38 Updated Mar 31, 2025

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,143 186 Updated Jun 6, 2025
Next
0