Stars
[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"
Get your documents ready for gen AI
Production-ready platform for agentic workflow development.
One for All Modalities Evaluation Toolkit - including text, image, video, audio tasks.
Famous Vision Language Models and Their Architectures
The Universe of Data. All about data, data science, and data engineering
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
braintwist: Mersenne twister as brainfuck code generator
TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)
1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
Official implementation of project Honeybee (CVPR 2024)
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Official PyTorch implementation of "Large-scale Bilingual Language-Image Contrastive Learning" (ICLRW 2022)
Polyglot: Large Language Models of Well-balanced Competence in Multi-languages
KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model to understand Korean instructions)
Implementation of Nougat Neural Optical Understanding for Academic Documents
OCR Annotations from Amazon Textract for Industry Documents Library
The official implementation of SPTS v2: Single-Point Text Spotting
The 3rd Place Solution of the Meta AI Video Similarity Challenge : Descriptor Track and Matching Track.