-
American University of Beirut
- Beirut, Lebanon
- https://www.linkedin.com/in/mohamadmansourx
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
🐮📢 The first AI voice assistant that interrupts *you*
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Open source multi-modal RAG for building AI apps over private knowledge.
A TTS model capable of generating ultra-realistic dialogue in one pass.
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
Pipelines: Versatile, UI-Agnostic OpenAI-Compatible Plugin Framework
DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financ…
A boilerplate project designed to enhance LLM usability directly within your CLI.
A simple screen parsing tool towards pure vision based GUI agent
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings" published at Odyssey 2024
This is the code for Deformable Neural Radiance Fields, a.k.a. Nerfies.
We write your reusable computer vision tools. 💜
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
This repo contains the code for 1D tokenizer and generator
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.
PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
[RA-L] DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction
⚡ TabPFN: Foundation Model for Tabular Data ⚡
An E2E solution for Arabic Handwritten Text OCR, with an application to extract text, enhance camera-scanned documents, and grade handwritten exams based on a model answer.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other building blocks
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Learn online intrinsic rewards from LLM feedback