8000 sayedmohamedscu (elsayed mohamed) / Starred · GitHub

More Web Proxy on the site http://driver.im/

sayedmohamedscu

Follow

elsayed mohamed sayedmohamedscu

Follow

Passionate computer vision engineer. Building ML solutions.

46 followers · 99 following

Achievements

Achievements

Lists (1)

Sort

🚀 My stack

Starred repositories

Yuliang-Liu / MonkeyOCR

A lightweight LMM-based Document Parsing Model

Python 1,544 95 Updated Jun 12, 2025

playht / PlayDiffusion

Python 455 37 Updated Jun 10, 2025

datalayer / jupyter-mcp-server

🪐 ✨ Model Context Protocol (MCP) Server for Jupyter.

Python 389 69 Updated Jun 4, 2025

OpenGVLab / SAM-Med2D

Official implementation of SAM-Med2D

Jupyter Notebook 992 97 Updated Jun 18, 2024

google-ai-edge / gallery

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 10,781 724 Updated Jun 11, 2025

ZZZHANG-jx / DocRes

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Python 449 53 Updated Jan 28, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,477 145 Updated Jun 12, 2025

Google-Health / medgemma

Jupyter Notebook 403 48 Updated May 28, 2025

sisig-ai / doctor

Doctor is a tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents.

Python 435 56 Updated May 24, 2025

microsoft / playwright-python

Python version of the Playwright testing and automation library.

Python 13,195 1,006 Updated Jun 11, 2025

apple / ml-fastvlm

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 4,159 219 Updated May 5, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 3,216 267 Updated Jun 12, 2025

n8n-io / n8n

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 106,737 30,249 Updated Jun 12, 2025

juanmc2005 / diart

A python package to build AI-powered real-time audio applications

Python 1,324 103 Updated Feb 12, 2025

QuentinFuxa / WhisperLiveKit

Python package for Real-time, Local Speech-to-Text and Speaker Diarization. FastAPI Server & Web Interface

Python 310 81 Updated May 28, 2025

caviri / BetterWhisperX

Forked from m-bain/whisperX

Better WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 15 2 Updated Oct 29, 2024

mikel-brostrom / boxmot

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

Python 7,402 1,805 Updated Jun 12, 2025

ConsistentlyInconsistentYT / Pixeltovoxelprojector

Projects motion of pixels to a voxel

C++ 708 172 Updated Mar 31, 2025

Dan-wanna-M / formatron

Formatron empowers everyone to control the format of language models' output with minimal overhead.

Python 202 6 Updated Jun 7, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.

56,977 17,354 Updated Jun 8, 2025

EvolvingLMMs-Lab / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,257 212 Updated Mar 5, 2024

NVlabs / PS3

Scaling Vision Pre-Training to 4K Resolution

Python 179 9 Updated Jun 5, 2025

bytedance / InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Python 2,360 246 Updated Apr 16, 2025

roboflow / rf-detr

RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.

Python 2,231 234 Updated Jun 10, 2025

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 20,979 1,230 Updated Jun 12, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 15,082 1,983 Updated Jun 12, 2025

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 5,003 410 Updated May 6, 2025

predibase / lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 3,008 215 Updated May 21, 2025

AbdooMohamedd / satellite-imagery-app

This repository analyzes satellite imagery to track the impact of the war on Gaza, using Sentinel Hub and Planet.com APIs for image retrieval and visualization.

Python 20 1 Updated Mar 15, 2025

artemcher / myotracker

Pruned CoTracker architecture for tracking the myocardium in 2D echo images.

Python 12 2 Updated May 6, 2025

Starred topics

Database

udacity

0