srama2512

Santhosh Kumar Ramakrishnan srama2512

Research Scientist at Apple

33 followers · 11 following

Achievements

Highlights

Lists (1)

Sort

✨ Inspiration

1 repository

Stars

llyx97 / video-t3

[ICLR 2025] "Temporal Reasoning Transfer from Text to Video", Lei Li, Yuanxin Liu, Linli Yao, Peiyuan Zhang, Chenxin An, Lean Wang, Xu Sun, Lingpeng Kong, Qi Liu

Python 7 Updated Apr 10, 2025

apple / ml-space-benchmark

Code and data for "Does Spatial Cognition Emerge in Frontier Models?"

Python 18 1 Updated Apr 18, 2025

apple / ml-cross-entropy

Python 474 40 Updated Jun 24, 2025

ziadalh / SpotEM

Python 3 Updated Feb 10, 2025

mangiucugna / json_repair

A python module to repair invalid JSON from LLMs

Python 2,258 102 Updated Jun 24, 2025

john-science / mazelib

A Python library for creating and solving mazes.

Python 255 57 Updated Mar 22, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 50,642 8,291 Updated Jun 24, 2025

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 2,475 157 Updated Jun 22, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,348 268 Updated Jun 19, 2025

rapidsai / cuml

cuML - RAPIDS Machine Learning Library

C++ 4,793 577 Updated Jun 24, 2025

Ram81 / habitat-web

Habitat-Web is a web application to collect human demonstrations for embodied tasks on Amazon Mechanical Turk (AMT) using the Habitat simulator.

JavaScript 57 2 Updated Jun 16, 2022

LCAV / pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,602 451 Updated May 26, 2025

xflr6 / graphviz

Simple Python interface for Graphviz

Python 1,718 219 Updated Jun 15, 2025

mikedh / trimesh

Python library for loading and using triangular meshes.

Python 3,273 614 Updated Jun 21, 2025

minghanqin / LangSplat

Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]

Python 826 92 Updated May 17, 2024

facebookresearch / chat2map-official

[CVPR 2023] Code and datasets for 'Chat2Map Efficient Scene Mapping from Multi-Ego Conversations'

Python 6 1 Updated Dec 8, 2023

ajzhai / PEANUT

[ICCV 2023] PEANUT: Predicting and Navigating to Unseen Targets

Python 49 5 Updated Mar 5, 2024

sail-sg / MDT

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Python 570 41 Updated Apr 23, 2024

facebookresearch / spot-sim2real

Spot Sim2Real Infrastructure

Python 97 7 Updated May 27, 2025

GT-RIPL / Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,785 295 Updated May 27, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,629 683 Updated Jun 24, 2025

thu-ml / unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,424 89 Updated May 31, 2023

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,507 1,509 Updated Sep 5, 2024

salesforce / paprika

Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"

Python 49 4 Updated Jan 27, 2025

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 10,908 1,004 Updated Jun 24, 2025

microsoft / JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 24,192 2,031 Updated Sep 26, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 50,576 5,948 Updated Sep 18, 2024

facebookresearch / eai-vc

The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).

Python 488 46 Updated May 1, 2024

srama2512 / NaQ

NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.

Python 15 Updated Jan 26, 2024

kxhit / vMAP

[CVPR 2023] vMAP: Vectorised Object Mapping for Neural Field SLAM

Python 358 21 Updated Jun 16, 2023