8000 Annusha (Anna Kukleva) / Starred · GitHub

More Web Proxy on the site http://driver.im/

Annusha

Follow

Anna Kukleva Annusha

Follow

Postdoctoral Researcher at Max Planck Institute for Informatics

110 followers · 53 following

https://annusha.github.io/

Achievements

Achievements

Lists (22)

Sort

awsome

backbones

10 repositories

captions

clustering

25 repositories

contrastve learning

diffusion_models

ego4d

few-shot

germany

learn

LLMs

long-tail

21 repositories

NCD

nlp

openset

resources

31 repositories

tech

time_transformers

transformers

video memory efficient

videos

work-in-progress

Stars

ninatu / utd-project

Official implementation of "Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks." CVPR 2025

3 Updated Mar 26, 2025

nikitaved / PyTorch_for_Scientists

A presentation/notebook that expresses my view on things making PyTorch efficient that targets researchers in AI and other domains.

Jupyter Notebook 7 Updated May 30, 2025

nikitaved / einsum_decomp

A presentation explaining how Einsum could be understood and implemented.

8 Updated Jan 3, 2025

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,905 187 Updated May 19, 2025

helblazer811 / ConceptAttention

ConceptAttention: A method for interpreting multi-modal diffusion transformers.

Jupyter Notebook 264 9 Updated Apr 14, 2025

VainF / Isomorphic-Pruning

[ECCV 2024] Isomorphic Pruning for Vision Models

Python 68 3 Updated Jul 23, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,663 4,402 Updated May 31, 2025

mohammadasim98 / met3r

MEt3R: Measuring Multi-View Consistency in Generated Images

Python 103 4 Updated May 2, 2025

diffusion-motion-transfer / diffusion-motion-transfer

Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""

Python 176 17 Updated Dec 13, 2023

rom1504 / clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,560 226 Updated Apr 15, 2024

mlflow / mlflow

Open source platform for the machine learning lifecycle

Python 20,671 4,550 Updated May 31, 2025

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

Jupyter Notebook 7,996 513 Updated Apr 29, 2025

facebookresearch / ToMe

A method to increase the speed and lower the memory footprint of existing vision transformers.

Python 1,058 72 Updated Jun 17, 2024

phflot / tfake

Dense facial landmarks for thermal imaging

Jupyter Notebook 10 Updated Mar 1, 2025

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,570 345 Updated May 29, 2025

iejMac / video2dataset

Easily create large video dataset from video urls

Python 611 71 Updated Jul 30, 2024

Visual-AI / FROSTER

The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"

Python 79 6 Updated Jan 14, 2025

dShvetsov / SieSollenDeutschLernen

Telegram Bot for learning German

Python 3 Updated Apr 29, 2024

zhaoyue-zephyrus / AVION

[arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"

Python 129 10 Updated Jul 31, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 17,606 1,711 Updated May 22, 2025

noorahmedds / OrCo

OrCo: Towards Better Generalization via Orthogonality and Contrast for Few-Shot Class-Incremental Learning

Python 23 2 Updated May 20, 2025

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,668 917 Updated Jul 1, 2024

HJYao00 / Side4Video

Python 39 3 Updated Apr 7, 2024

X-PLUG / mPLUG-2

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)

Python 227 20 Updated Jul 21, 2023

cxli233 / FriendsDontLetFriends

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

R 6,726 266 Updated Dec 10, 2024

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,399 2,500 Updated May 29, 2025

facebookresearch / audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

Python 2,801 268 Updated Sep 15, 2024

allenai / allennlp

An open-source NLP research library, built on PyTorch.

Python 11,850 2,246 Updated Nov 22, 2022

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,657 2,500 Updated Aug 12, 2024

Chuhanxx / helping_hand_for_egocentric_videos

Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'

Python 33 2 Updated Nov 7, 2023

0