-
Algomatic
- Japan
-
08:54
(UTC +09:00) - https://sites.google.com/view/yusukemikami
- in/yusukemikami
Lists (20)
Sort Name ascending (A-Z)
🔥 Agent project
🧠 awesome Agent
✏️ Awesome-LLM
Compression
Conference
Diffusion
🪙 Finance
Hallucination
👨🔬 LLM survey
LLM Tool
🔄LLM Training
LLMOps
OpenAI
🗺️ Planner
📝 Prompt Engineering/cot
RAG
🤖 robot
Robot-Benchmark
⭐ VLM
World model
Starred repositories
RepText: Rendering Visual Text via Replicating 🔥
Open-source vector similarity search for Postgres
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
[CVPR 2024 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation
End-to-End Object Detection with Transformers
The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'
Anthropic's Interactive Prompt Engineering Tutorial
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
The official Python library for the OpenAI API
Adding guardrails to large language models.
🤗 smolagents: a barebones library for agents that think in code.
A collection of examples that show how to use CrewAI framework to automate workflows.
A lightweight, powerful framework for multi-agent workflows
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
Agent Framework / shim to use Pydantic with LLMs
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
(TPAMI 2024) A Survey on Open Vocabulary Learning
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach. This repository includes the implementation of our method for training and evaluation of a Logistic Regres…
The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.
This repository is for the paper entitled: From News to Forecast: Integrating Event Analysis in LLM-based Time Series Forecasting with Reflection (NeurIPS 2024)
11 Lessons to Get Started Building AI Agents
🤖 The next generation of Multi-Modal Multi-Agent platform. 👾 🦄 🔮
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
Unified Backend Framework for APIs, Events, and AI Agents
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.