8000 savageyusuff (Mulliana Yusuff) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View savageyusuff's full-sized avatar
  • Panasonic Research and Development Center Singapore
  • Singapore

Block or report savageyusuff

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

视觉(语义) SLAM 相关研究跟踪

1,813 406 Updated Jul 5, 2021

PyTorch implementation of MobileFaceNets

Python 89 30 Updated May 18, 2022

MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification on Mobile 10000 Devices

Python 381 89 Updated Jan 4, 2024

🐳 A curated list of Docker resources and projects

32,415 3,096 Updated May 18, 2025

A feature-rich command-line audio/video downloader

Python 114,176 9,011 Updated Jun 3, 2025

target speaker extraction and verification for multi-talker speech

Python 178 31 Updated Jan 24, 2021

speech emotion recognition using a convolutional recurrent networks based on IEMOCAP

Python 398 141 Updated Jul 8, 2019

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

Cuda 528 95 Updated Apr 22, 2025

SLAM - Simultaneous localization and mapping using OpenCV and NumPy.

Python 151 42 Updated Apr 10, 2022

Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/

Jupyter Notebook 256 45 Updated May 18, 2025

This repository contains a multi-fisheye camera SLAM. The underlying SLAM system is based on ORB-SLAM.

C++ 659 224 Updated Aug 5, 2020

Robust Pose Graph Optimization

C++ 512 135 Updated Mar 27, 2025

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Python 645 114 Updated Oct 3, 2020

DSO with SIM(3) pose graph optimization and loop closure

C++ 690 238 Updated Jul 30, 2020

FBOW (Fast Bag of Words) is an extremmely optimized version of the DBow2/DBow3 libraries.

C++ 600 145 Updated Nov 22, 2021

Large, modern dataset for speech recognition

Shell 677 62 Updated Feb 26, 2024

Speaker embedding (d-vector) trained with GE2E loss

Python 282 46 Updated Jan 8, 2024

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,057 203 Updated May 26, 2025

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python 484 120 Updated Jul 1, 2021

A dataset for estimation of hand pose and shape from single color images.

Python 402 81 Updated Jan 21, 2022

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python 620 192 Updated May 27, 2023

Diarization scoring tools.

Python 247 43 Updated Mar 28, 2023

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 1,032 184 Updated Dec 22, 2023

Generating room impulse responses

C++ 452 148 Updated Dec 20, 2023

Tools for handling multimodal data in machine learning projects.

Python 1,024 233 Updated May 22, 2025
Python 493 48 Updated Jun 25, 2024

A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.

Python 75 11 Updated Aug 19, 2022

Official repository for RawNet, RawNet2, and RawNet3

Python 378 56 Updated Mar 21, 2024
Next
0