-
Columbia University
Starred repositories
victorb / ollama-swarm
Forked from openai/swarmEducational framework exploring ergonomic, lightweight multi-agent orchestration. Modified to use local Ollama endpoint
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
A golang ebook intro how to build a web with golang
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
This is the official code release for our work, Denoising Vision Transformers.
High-speed Large Language Model Serving for Local Deployment
[CVPR 2024 Highlight] SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors
Curated list of project-based tutorials
[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration
Official code for CVPR 2022 (Oral) paper "Deep Visual Geo-localization Benchmark"
cjl09 / MiniGPT-4-local
Forked from Vision-CAIR/MiniGPT-4MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Official repository for R2Former: Unified Retrieval and Reranking Transformer for Place Recognition
Official repository for R2Former: Unified Retrieval and Reranking Transformer for Place Recognition
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
A Gradio web UI for Large Language Models with support for multiple inference backends.
Official Repository of ChatCaptioner
Code for 3D-LLM: Injecting the 3D World into Large Language Models
Simple Chainlit app to have interaction with your documents using different vectorstores.
Automatic download VPR datasets in a standard format
AnyLoc: Universal Visual Place Recognition (RA-L 2023)
👋 Hey there new grad🎉! We've put together a collection of full-time job openings for SWE, Quant, PM and tech roles in 2024! 🚀
Build and run Docker containers leveraging NVIDIA GPUs