NavBot

Input types

Voice - Speech recognition
Eyes - Eye tracker
Hands - Keyboard + Mouse (default)

Output types

Audio
Visual (default)

NavBot Keyword Commands:

Change Language - Change language
Help - Help menu
Exit Program - Exit program
Input - Change input type
Output - Change output type

Main

What do you want to do?
User states their main objective: LLM may
- issue instructions to NavBot -> describe web contents and options for user
- ask the user to clarify
thread is waiting to receive input (from any source)
- if input is audio, then transcribe to english and send instructions to crawler
- if input is video,
perform action and output results
- if action could not be performed. Append to response.
output
- if output is audio, then describe the contents of the website and output possible actions.
"NavBot Settings" is a keyword to access the settings
- Change langua 8000 ge
- Help - lists all settings and what it can do
- Change input type
- Change output type
- Disable/Enable keyword speaking.

Settings Implementation

Transcription recognize keyword (Hey NavBot)
VectorDB for identifying instruction to setting
Use LLM for parsing results of instruction
Use VectorDB to verify results of instruction

TODO

Settings implementation
Implement mic input and transcription module - https://platform.openai.com/docs/guides/speech-to-text/prompting
Implement eye tracker input module
Implement keyboard input module
Implement audio output module
Implement simpler visual output module
Connect inputs to crawler
Handle reading output for different module

Extra

Code Flow

Ask user for input
Navigate to web page
Read web page, summarize contents, get links, key elements
Should I continue to issue commands and navigate or should I ask the user to clarify?
If i ask the user to clarify, I will describe the web page to them and what options they have.

Start a thread to get input data and pass in the queue to put the transcriptions in

Main will get the instruction, It should decide whether to issue a command or to ask the user to clarify

While it is issuing commands:

If it issues a command:

it says what it is doing
and performs that command

If it asks a question:

it sends that question

start thread for input

start thread for output

while true:

main is listening for input

While true

if new input is received (non-blocking), add to chat history

issue command, send narration

if question, send question, break

options

inputs: user, new page
if user says something: generate new objective given chat history

if new page is loaded:

- generate command from objective, web content
- if command is finish
  - break loop and wait for user input
-

generate objective from chat history,

Audio is asynchronous so..

""" User:

brand preferences:
products wanted:

sephora maybelline mac nordstrom ulta nyx cosmetics beauty sense """

""" except requests.exceptions.Timeout: send_output( "I'm sorry, I'm having trouble connecting to the large language model. Please try again later.", chat_history, output_thread) break """

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
IO		IO
settings		settings
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
crawler.py		crawler.py
language-breakdown.svg		language-breakdown.svg
main.py		main.py
prompts.py		prompts.py
requirements.txt		requirements.txt
text_gen_openai.py		text_gen_openai.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NavBot

Input types

Output types

NavBot Keyword Commands:

Main

Settings Implementation

TODO

Extra

Code Flow

start thread for input

start thread for output

while true:

main is listening for input

While true

if new input is received (non-blocking), add to chat history

issue command, send narration

if question, send question, break

if new page is loaded:

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

kael558/NavBot

Folders and files

Latest commit

History

Repository files navigation

NavBot

Input types

Output types

NavBot Keyword Commands:

Main

Settings Implementation

TODO

Extra

Code Flow

start thread for input

start thread for output

while true:

main is listening for input

While true

if new input is received (non-blocking), add to chat history

issue command, send narration

if question, send question, break

if new page is loaded:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages