Lists (3)
Sort Name ascending (A-Z)
Stars
[LREC-COLING 2024 (Oral), Interspeech 2024 (Oral), NAACL 2025, ACL 2025] A Series of Multilingual Multitask Medical Speech Processing
VertexAI Kubeflow pipeline for training a Fashion MNIST classifier
This is the code repository for Deep Learning with Hadoop, published by Packt
LlamaIndex is the leading framework for building LLM-powered agents over your data.
🦜🔗 Build context-aware reasoning applications
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
Workshop "Introduction to Observational Seismology" hosted at Hanoi University of Science from 21 to 25 April 2025.
Video-Infinity generates long videos quickly using multiple GPUs without extra training.
Industry leading face manipulation platform
im2recipe Pytorch implementation
An Application for Generating a cooking recipe consist of title, ingredients and instructions from an food image using Deep Learning.
Stable Diffusion web UI
Converter Video to Anime. Based on "AnimeGAN" repository.
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
Bản dịch tiếng Việt của https://teachyourselfcs.com
Comparative Analysis of Techniques for Forecasting Time Series in Financial Markets
Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
视频的文本摘要(标注),输入一段视频,通过深度学习网络和人工智能程序识别视频主要表达的意思(Input a video output a txt decribing the video)。