mrdbourke

🚀

Daniel Bourke mrdbourke

🚀

Machine Learning Engineer live on YouTube.

10.4k followers · 51 following

Sponsoring

Achievements

x4 x4 x2

Achievements

x4 x4 x2

Highlights

Lists (3)

Sort

Stars

facebookresearch / vjepa2

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 1,013 65 Updated Jun 13, 2025

ClaudiaCuttano / SANSA

Official repository for the paper "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."

70 Updated May 29, 2025

ClaudiaCuttano / SAMWISE

[CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"

Python 246 10 Updated Jun 5, 2025

LIONS-EPFL / LEAF

Robustness in Both Domains: CLIP Needs a Robust Text Encoder

Python 2 Updated Jun 5, 2025

sachinhosmani / torchvista

Interactive Pytorch forward pass visualization in notebooks

Python 217 11 Updated 8000 Jun 10, 2025

AdonaiVera / openset-reid-finetune

Python 4 3 Updated Jun 5, 2025

roboflow / inference

Turn any computer or edge device into a command center for your computer vision projects.

Python 1,737 188 Updated Jun 13, 2025

google-gemini / gemini-fullstack-langgraph-quickstart

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 13,124 2,024 Updated Jun 10, 2025

isaiahbjork / expo-kokoro-onnx

Run Kokoro TTS locally on device using Expo & ONNX Runtime

TypeScript 15 2 Updated May 8, 2025

infinitered / react-native-mlkit

react-native-mlkit - The definitive MLKit wrapper for React Native and Expo

TypeScript 286 19 Updated May 21, 2025

google / XNNPACK

High-efficiency floating-point neural network inference operators for mobile, server, and Web

C 2,046 424 Updated Jun 14, 2025

ma-xu / Rewrite-the-Stars

[CVPR 2024] Rewrite the Stars

Python 390 19 Updated May 7, 2024

huggingface / optimum-executorch

🤗 Optimum ExecuTorch

Python 50 11 Updated Jun 10, 2025

huggingface / swift-transformers

Swift Package to implement a transformers-like API in Swift

Swift 989 129 Updated Jun 5, 2025

pytorch / executorch

On-device AI across mobile, embedded and edge for PyTorch

Python 2,951 594 Updated Jun 14, 2025

software-mansion / react-native-executorch

Declarative way to run AI models in React Native on device, powered by ExecuTorch.

C++ 776 37 Updated Jun 13, 2025

bytedance / Dolphin

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 1,437 87 Updated Jun 13, 2025

om-ai-lab / OVDEval

A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)

Python 50 3 Updated May 7, 2024

openvinotoolkit / openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 8,427 2,627 Updated Jun 14, 2025

openvinotoolkit / openvino_notebooks

📚 Jupyter notebook tutorials for OpenVINO™

Jupyter Notebook 2,826 898 Updated Jun 13, 2025

apple / embedding-atlas

Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.

TypeScript 157 5 Updated Jun 11, 2025

google-ai-edge / LiteRT

LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-device AI, now with an expanded vision.

C++ 576 74 Updated Jun 14, 2025

google-ai-edge / gallery

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 10,957 738 Updated Jun 13, 2025

iSEE-Laboratory / LLMDet

(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 226 14 Updated Jun 6, 2025

JiuhaiChen / BLIP3o

Python 1,175 44 Updated Jun 10, 2025

LARC-CMU-SMU / FoodSeg103-Benchmark-v1

MM'21 Main-Track paper

Python 115 36 Updated Jan 17, 2024

ngxson / smolvlm-realtime-webcam

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 3,923 555 Updated May 12, 2025

facebookresearch / vggt

[CVPR 2025 Best Paper Award Candidate] VGGT: Visual Geometry Grounded Transformer

Python 7,688 786 Updated Jun 11, 2025

niki-amini-naieni / CountGD

Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.

Python 243 24 Updated May 29, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,207 44 Updated May 21, 2025

Daniel Bourke mrdbourke

Sponsoring

Highlights

Lists (3)

ML

Nutrify

Time Series

Stars