8000 mharrisonbaker (Matthew Baker) Β· GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View mharrisonbaker's full-sized avatar

Block or report mharrisonbaker

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mharrisonbaker/README.md

πŸ‘‹ Hi, I'm Matthew Baker

I’m an AI/ML Product Owner with a proven track record building scalable machine learning systems in regulated environments β€” currently at the U.S. Patent and Trademark Office (USPTO). My work focuses on transforming real-world workflows into ML pipelines and AI-powered services.

🧠 What I Do

  • πŸš€ Lead end-to-end ML product development β€” from prototype to production
  • πŸ”Ž Build retrieval-augmented generation (RAG) systems for professional workflows
  • πŸ“Š Translate stakeholder needs into clean data pipelines running 24/7 in Production
  • ☁️ Manage deployment of containerized inference services on AWS, Azure, and on-prem solutions

πŸ“Œ Some Featured Projects (outside of my work duties)

  • RAG-RAG Starter Kit
    A reality-rooted RAG framework for federal compliance. No Digital DMT πŸŒ€ included β€” this architecture keeps AI systems grounded in verifiable sources and compliant with federal requirements.

  • CODE @USPTO Newsletter Project
    Born out of laziness, a Python + React system to streamline newsletter creation and distribution for the Club for Open Data Enthusiasts (C.O.D.E.); not an official USPTO project :)

  • Chat-MPEP
    Indexed, embedded, and runs locally. Chat-MPEP transforms the USPTO’s Manual of Patent Examining Proce 59CD dure from thorny HTML format into structured JSON 🧼 and powers an interactive chatbot using LlamaIndex and Microsoft Phi. Demoed on an airgapped laptop at USPTO Community Day 2024.

  • CPC Definition Expansion Tool
    Sample code from a much larger project I am working on (to be released). THe goal is to create Human-readable definitions for 250,000 CPC symbols using LLMs and a deep respect for taxonomy. Finally a way through the classification rabbit hole without losing your head πŸ‘‘

πŸ“« Find Me

Pinned Loading

  1. USPTOCode/uspto-newsletter USPTOCode/uspto-newsletter Public

    Python

  2. USPTOCode/expandedCPCdefinitions USPTOCode/expandedCPCdefinitions Public

    provides expanded CPC definitions

    HTML

  3. USPTOCode/MPEP-Chatbot USPTOCode/MPEP-Chatbot Public

    RDMS MPEP to JSON MPEP

    HTML 1

0