8000 DaehanKim (DaehanKim) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View DaehanKim's full-sized avatar

Organizations

@TmaxEdu

Block or report DaehanKim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

pyCrossfade is the result of a personal project to use beat matching, gradual bpm shift on bars, and EQ modification to provide smooth and tunable transitions between music files.

Python 128 15 Updated Dec 27, 2024

Agentless🐱: an agentless approach to automatically solve software development problems

Python 1,670 177 Updated Dec 22, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,257 2,231 Updated Feb 1, 2025

We study toy models of skill learning.

Jupyter Notebook 26 2 Updated Jan 20, 2025

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 556 39 Updated Apr 8, 2025

AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Python 209 13 Updated Apr 20, 2025

This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.

1,271 298 Updated Feb 12, 2025

Run Streamlit Apps as serverless on AWS with HTTPS

Python 12 6 Updated Jan 28, 2025

Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens

Python 463 67 Updated Jun 11, 2024

The Universe of Evaluation. All about the evaluation for LLMs.

Python 225 25 Updated Jul 9, 2024

Universal Romanizer that can convert any unicode script to roman (latin) script

Perl 197 18 Updated Jul 26, 2024

Large Reasoning Models

Python 805 45 Updated Dec 3, 2024
Jupyter Notebook 7,495 1,353 Upd BC25 ated Sep 22, 2024

Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.

Python 88 11 Updated Mar 5, 2022

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,023 130 Updated Sep 5, 2024

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

178 13 Updated Sep 27, 2024

g2pK: g2p module for Korean

Python 250 43 Updated Mar 1, 2022

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 773 42 Updated Mar 5, 2025

Train high-quality text-to-image diffusion models in a data & compute efficient manner

Python 495 36 Updated Mar 27, 2025

Let us democratise high-resolution generation! (CVPR 2024)

Jupyter Notebook 2,010 225 Updated Apr 15, 2024

All generative model in one for better TTS model

Python 71 8 Updated Sep 8, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 14,416 2,864 Updated May 16, 2025

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,884 124 Updated May 8, 2025

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 433 24 Updated May 16, 2025

UNet diffusion model in pure CUDA

Cuda 604 28 Updated Jun 28, 2024

A library for mechanistic interpretability of GPT-style language models

Python 2,158 379 Updated May 15, 2025

maximal update parametrization (µP)

Jupyter Notebook 1,512 99 Updated Jul 17, 2024

YaFSDP: Yet another Fully Sharded Data Parallel

Python 964 49 Updated May 14, 2025
Next
0