-
09:45
(UTC -07:00)
Highlights
- Pro
Stars
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A feature-rich command-line audio/video downloader
Command-line program to download videos from YouTube.com and other video sites
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Ranking (CTR/CVR prediction), Post Ranking, Large Model (Generative Recommen…
An extremely fast Python package and project manager, written in Rust.
Unofficial Bitwarden compatible server written in Rust, formerly known as bitwarden_rs
⭐Github Ranking⭐ Github stars and forks ranking list. Github Top100 stars list of different languages. Automatically update daily. | Github仓库排名,每日自动更新
A curated collection of open-source macOS applications built with Swift
The home of the Web Applets spec, demo and SDK
Example Jupyter notebooks for OpenMC
A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups
[pip install medmnist] 18x Standardized Datasets for 2D and 3D Biomedical Image Classification
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
DiffFace: Diffusion-based Face Swapping with Facial Guidance
Our implementation of Text Style Brush architecture.
Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.
画像データ拡張ライブラリAlbumentationsのJupyter上での実行例。
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Implementation of Bidirectional Scene Text Recognition with a Single Decoder
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools,…
Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features (MATRN) in ECCV 2022.