-
IIITMK
- Thiruvananthapuram
- https://www.linkedin.com/in/rahulrajpvr7d/
- https://rahulrajpvr7d.medium.com
- rahulrajpvr7d
- rahulrajpvr7d
- @rahulrajpv_r7d
Lists (3)
Sort Name ascending (A-Z)
Starred repositories
Generative image model with learned similarity measures
Generate video summary report at scale using generative AI and serverless on AWS
Neural Networks: Zero to Hero
😈Awful AI is a curated list to track current scary usages of AI - hoping to raise awareness
Code for data analysis and visualization for the data descriptor "Multimodal brain responses during movie watching"
TransNet V2: Shot Boundary Detection Neural Network
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
Open-Sora: Democratizing Efficient Video Production for All
Books related to Artificial Intelligence, Machine Learning, Deep Learning and Neural Networks
GeoJson Data of Indian States with boundaries
Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)
LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and…
Data extraction with LLM on CPU
Pytorch implementation of Paper by Google Research - Representation Learning for Information Extraction from Form-like Documents.
This SDK is now deprecated, use the new unified Google GenAI SDK.
Facial Expression Recognition Using CNN and Haar-Cascade
GPT 3.5/4 with a Chat Web UI. No API key required.
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
GPT-4 Vision Chatbot examples
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
GPT-4 Vision Chrom 4208 e Extension
Lightweight GPT-4 Vision processing over the Webcam