8000 mrdbourke (Daniel Bourke) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View mrdbourke's full-sized avatar
🚀
🚀

Sponsoring

@squidfunk
@danielfrg

Highlights

  • Pro

Block or report mrdbourke

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
8000
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 1,013 65 Updated Jun 13, 2025

Official repository for the paper "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."

70 Updated May 29, 2025

[CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"

Python 246 10 Updated Jun 5, 2025

Robustness in Both Domains: CLIP Needs a Robust Text Encoder

Python 2 Updated Jun 5, 2025

Interactive Pytorch forward pass visualization in notebooks

Python 217 11 Updated 8000 Jun 10, 2025

Turn any computer or edge device into a command center for your computer vision projects.

Python 1,737 188 Updated Jun 13, 2025

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 13,124 2,024 Updated Jun 10, 2025

Run Kokoro TTS locally on device using Expo & ONNX Runtime

TypeScript 15 2 Updated May 8, 2025

react-native-mlkit - The definitive MLKit wrapper for React Native and Expo

TypeScript 286 19 Updated May 21, 2025

High-efficiency floating-point neural network inference operators for mobile, server, and Web

C 2,046 424 Updated Jun 14, 2025

[CVPR 2024] Rewrite the Stars

Python 390 19 Updated May 7, 2024

🤗 Optimum ExecuTorch

Python 50 11 Updated Jun 10, 2025

Swift Package to implement a transformers-like API in Swift

Swift 989 129 Updated Jun 5, 2025

On-device AI across mobile, embedded and edge for PyTorch

Python 2,951 594 Updated Jun 14, 2025

Declarative way to run AI models in React Native on device, powered by ExecuTorch.

C++ 776 37 Updated Jun 13, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 1,437 87 Updated Jun 13, 2025

A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)

Python 50 3 Updated May 7, 2024

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 8,427 2,627 Updated Jun 14, 2025

📚 Jupyter notebook tutorials for OpenVINO™

Jupyter Notebook 2,826 898 Updated Jun 13, 2025

Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.

TypeScript 157 5 Updated Jun 11, 2025

LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-device AI, now with an expanded vision.

C++ 576 74 Updated Jun 14, 2025

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 10,957 738 Updated Jun 13, 2025

(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 226 14 Updated Jun 6, 2025
Python 1,175 44 Updated Jun 10, 2025

MM'21 Main-Track paper

Python 115 36 Updated Jan 17, 2024

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 3,923 555 Updated May 12, 2025

[CVPR 2025 Best Paper Award Candidate] VGGT: Visual Geometry Grounded Transformer

Python 7,688 786 Updated Jun 11, 2025

Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.

Python 243 24 Updated May 29, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,207 44 Updated May 21, 2025
Next
0