8000 srama2512 (Santhosh Kumar Ramakrishnan) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View srama2512's full-sized avatar

Highlights

  • Pro

Block or report srama2512

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025] "Temporal Reasoning Transfer from Text to Video", Lei Li, Yuanxin Liu, Linli Yao, Peiyuan Zhang, Chenxin An, Lean Wang, Xu Sun, Lingpeng Kong, Qi Liu

Python 7 Updated Apr 10, 2025

Code and data for "Does Spatial Cognition Emerge in Frontier Models?"

Python 18 1 Updated Apr 18, 2025
Python 474 40 Updated Jun 24, 2025
Python 3 Updated Feb 10, 2025

A python module to repair invalid JSON from LLMs

Python 2,258 102 Updated Jun 24, 2025

A Python library for creating and solving mazes.

Python 255 57 Updated Mar 22, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 50,642 8,291 Updated Jun 24, 2025

Tile primitives for speedy kernels

Cuda 2,475 157 Updated Jun 22, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,348 268 Updated Jun 19, 2025

cuML - RAPIDS Machine Learning Library

C++ 4,793 577 Updated Jun 24, 2025

Habitat-Web is a web application to collect human demonstrations for embodied tasks on Amazon Mechanical Turk (AMT) using the Habitat simulator.

JavaScript 57 2 Updated Jun 16, 2022

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,602 451 Updated May 26, 2025

Simple Python interface for Graphviz

Python 1,718 219 Updated Jun 15, 2025

Python library for loading and using triangular meshes.

Python 3,273 614 Updated Jun 21, 2025

Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]

Python 826 92 Updated May 17, 2024

[CVPR 2023] Code and datasets for 'Chat2Map Efficient Scene Mapping from Multi-Ego Conversations'

Python 6 1 Updated Dec 8, 2023

[ICCV 2023] PEANUT: Predicting and Navigating to Unseen Targets

Python 49 5 Updated Mar 5, 2024

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Python 570 41 Updated Apr 23, 2024

Spot Sim2Real Infrastructure

Python 97 7 Updated May 27, 2025

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,785 295 Updated May 27, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,629 683 Updated Jun 24, 2025

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,424 89 Updated May 31, 2023

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,507 1,509 Updated Sep 5, 2024

Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"

Python 49 4 Updated Jan 27, 2025

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 10,908 1,004 Updated Jun 24, 2025

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 24,192 2,031 Updated Sep 26, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 50,576 5,948 Updated Sep 18, 2024

The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).

Python 488 46 Updated May 1, 2024

NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.

Python 15 1 Updated Jan 26, 2024

[CVPR 2023] vMAP: Vectorised Object Mapping for Neural Field SLAM

Python 358 21 Updated Jun 16, 2023
Next
0