8000 VarunBelagali98 (Varun Belagali) / Starred Β· GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View VarunBelagali98's full-sized avatar

Organizations

@cvlab-stonybrook

Block or report VarunBelagali98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[WIP] Code for LangToMo

Python 7 Updated May 13, 2025

Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"

Python 106 2 Updated Apr 10, 2025

Official Implementation of DINO-Foresight: Looking into the Future with DINO

Python 51 1 Updated Feb 25, 2025

Fast Vision Mamba : Pool your Spatial Dimensions for Accelerated Processing

Python 12 1 Updated Jan 28, 2025

Advanced Privacy-Preserving Federated Learning framework

Python 131 24 Updated May 20, 2025

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Python 147 7 Updated Nov 5, 2024
Python 3 Updated Dec 28, 2023

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 407 19 Updated Mar 17, 2025

Theia: Distilling Diverse Vision Foundation Models for Robot Learning

Python 231 8 Updated Apr 3, 2025

The n-gram Language Model

C 1,421 100 Updated Aug 5, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,890 188 Updated May 19, 2025

[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Python 214 5 Updated Mar 29, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,906 132 Updated Oct 30, 2024

CVPR 2024: Learned representation-guided diffusion models for large-image generation

Jupyter Notebook 47 11 Updated Oct 8, 2024

πŸ“š A collection of papers about Referring Image Segmentation.

719 57 Updated Apr 14, 2025

[ICRA'24] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning

Python 67 2 Updated Aug 4, 2024

Language Repository for Long Video Understanding

Python 31 3 Updated Jun 17, 2024

πŸ€– [ICLR'25] Multimodal Video Understanding Framework (MVU)

Python 40 3 Updated Jan 31, 2025

Inference code for Llama models

Python 58,267 9,776 Updated Jan 26, 2025

Official Code for PathLDM: Text conditioned Latent Diffusion Model for Histopathology (WACV 2024)

Jupyter Notebook 42 4 Updated Jul 7, 2024

Implementation of popular ML algorithms from scratch

Python 843 257 Updated Jan 9, 2024

Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)

Jupyter Notebook 36 5 Updated Jan 1, 2024

[ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

Python 33 2 Updated Mar 7, 2025

Generative Models by Stability AI

Python 25,903 2,874 Updated May 20, 2025

GPT4 based personalized ArXiv paper assistant bot

Python 520 132 Updated Mar 26, 2024

[ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions

Python 523 21 Updated Jun 24, 2024
Python 8,622 505 Updated Oct 9, 2024

Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors

Jupyter Notebook 27 1 Updated Jun 2, 2024

πŸ‘‹ Hey there new gradπŸŽ‰! We've put together a collection of full-time job openings for SWE, Quant, PM and tech roles in 2024! πŸš€

Python 6,348 566 Updated Nov 26, 2024

Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))

Python 61 7 Updated Oct 24, 2023
Next
0