8000 DCNemesis / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View DCNemesis's full-sized avatar

Block or report DCNemesis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

phoneme tokenizer and grapheme-to-phoneme model for 8k languages

Python 162 17 Updated Jun 9, 2023

Scalable and memory-optimized training of diffusion models

Python 1,206 130 Updated Jun 4, 2025

a curated list of speech datasets (110+ datasets, 75+ easy to download)

137 8 Updated Feb 15, 2023

Refine high-quality datasets and visual AI models

Python 9,675 647 Updated Jul 4, 2025

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 547 29 Updated Sep 16, 2024
Python 367 59 Updated Sep 3, 2024

This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)

Python 204 22 Updated Jun 5, 2025

AI powered speech denoising and enhancement

Python 1,862 221 Updated Dec 3, 2024

ColabKit is a Python library designed to enhance the experience of working in Google Colab environments. With ColabKit, you can simplify common tasks, manipulate media, record audio, and create int…

Python 2 Updated May 30, 2024

Universal multilingual automatic speech transcription into IPA

Python 65 11 Updated Feb 28, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 15,523 2,087 Updated Jul 5, 2025

TensorFlow code and pre-trained models for BERT

Python 39,296 9,687 Updated Jul 23, 2024

Code for team "techies" to run POS tagger during afternoon activity

Jupyter Notebook 3 3 Updated Feb 2, 2024

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 3,406 254 Updated May 8, 2025

The Huskylens library ported to Java for the 2024 FRC Season

Java 4 1 Updated Jan 24, 2024

Robot telemetry application

TypeScript 232 74 Updated Jun 30, 2025

Finetune VITS and MMS using HuggingFace's tools

Python 158 53 Updated Mar 31, 2024

A Java libraries to manage USB devices like Controllers, Arduinos, IMUs, GPS, etc...

C 4 Updated Apr 29, 2018

AprilTag tracking and pose estimation in python for FRC

Python 27 4 Updated Jan 12, 2024

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,562 276 Updated Jan 12, 2025

Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.

Python 265 50 Updated Jun 12, 2025

Universal Romanizer that can convert any unicode script to roman (latin) script

Perl 213 22 Updated Jul 26, 2024

Phonetisaurus G2P

Shell 482 125 Updated Jun 1, 2024

ROS files for full SLAM navigation for FRC robots. This requires a Jetson TX2 with Jetpack 3.3, Ubuntu 16.04, and ROS Kinetic.

C++ 16 1 Updated Feb 17, 2021

FRC library with V-SLAM, trajectory generation, and LIDAR object detection capabilities

Java 19 8 Updated Mar 26, 2021

Unofficial implementation of NVIDIA P-Flow TTS paper

Python 225 32 Updated Dec 24, 2024
Jupyter Notebook 67 9 Updated Apr 4, 2024

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 54,639 9,021 Updated May 30, 2025

prompt2model - Generate Deployable Models from Natural Language Instructions

Python 2,003 186 Updated Dec 29, 2024
Next
CAD
0