Stars
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
"what, how, where, and how well? a survey on test-time scaling in large language models" repository
A collection of awesome video generation studies.
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
✨✨Latest Advances on Multimodal Large Language Models
Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos
Recent LLM-based CV and related works. Welcome to comment/contribute!
[ACL 2023 Findings] FACTUAL dataset, the textual scene graph parser trained on FACTUAL.
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
(ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
[AAAI2023] Token Mixing: Parameter-Efficient Transfer Learning from Image-Language to Video-Language
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
2023年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
🌍 针对小白的算法训练 | 包括四部分:①.大厂面经 ②.力扣图解 ③.千本开源电子书 ④.百张技术思维导图(项目花了上百小时,希望可以点 star 支持,🌹感谢~)推荐免费ChatGPT使用网站
Download resources from online storage with ONLY ONE command line!!
[ECCV2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
ruotianluo / coco-caption
Forked from tylin/coco-captionThe code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".
A list of Human-Object Interaction Learning.
DeepLab v3+ model in PyTorch. Support different backbones.
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
阿里云天池大赛2019——肺部CT多病种智能诊断,参赛代码