Note
Use latest MacOS and Windows version, older versions have limited support
Note
During testing it wont answer if you ask something, you need to simulate interviewer asking question, which it will answer
A real-time AI assistant that provides contextual help during video calls, interviews, presentations, and meetings using screen capture and audio analysis.
- Multiple AI Models: Choose between Google Gemini 2.0 Flash Live and OpenAI GPT-4 Real-time API
- Live AI Assistance: Real-time help powered by your preferred AI model
- Screen & Audio Capture: Analyzes what you see and hear for contextual responses
- Multiple Profiles: Interview, Sales Call, Business Meeting, Presentation, Negotiation
- Transparent Overlay: Always-on-top window that can be positioned anywhere
- Click-through Mode: Make window transparent to clicks when needed
- Cross-platform: Works on macOS, Windows, and Linux (kinda, dont use, just for testing rn)
- Choose Your AI Model: Select between Gemini or OpenAI in the model dropdown
- Get an API Key:
- Gemini: Visit Google AI Studio
- OpenAI: Visit OpenAI Platform
- Install Dependencies:
npm install
- Run the App:
npm start
- Select your preferred AI model (Gemini or OpenAI) in the model dropdown
- Enter your API key for the selected model
- Choose your profile and language in settings
- Click "Start Session" to begin
- Position the window using keyboard shortcuts
- The AI will provide real-time assistance based on your screen and what interview asks
- Window Movement:
Ctrl/Cmd + Arrow Keys
- Move window - Click-through:
Ctrl/Cmd + M
- Toggle mouse events - Close/Back:
Ctrl/Cmd + \
- Close window or go back - Send Message:
Enter
- Send text to AI
- macOS: SystemAudioDump for system audio
- Windows: Loopback audio capture
- Linux: Microphone input
- Electron-compatible OS (macOS, Windows, Linux)
- API key for your chosen model (Gemini or OpenAI)
- Screen recording permissions
- Microphone/audio permissions