An AI-powered stealth overlay application with screen capture protection, system audio recording, and multimodal AI integration using Google Gemini.
- Stealth Overlay: Content protection prevents screen capture in recordings/streams
- System Audio Recording: Capture and analyze audio playing on your computer
- Screenshot Analysis: AI-powered screen capture and analysis
- Chat Interface: Clean chat UI for AI interactions
- Global Hotkeys: Quick access from anywhere
- Multimodal AI: Text, image, and audio processing with Google Gemini
Download and install from nodejs.org
Windows:
- Download from ffmpeg.org
- Extract to
C:\ffmpeg - Add
C:\ffmpeg\binto your PATH environment variable
macOS:
brew install ffmpegLinux:
sudo apt install ffmpegWindows:
- Download from sox.sourceforge.net
- Install to default location
- Add SoX installation directory to PATH
macOS:
brew install soxLinux:
sudo apt install sox- Download from vb-audio.com
- Install the Virtual Cable driver
- Restart your computer after installation
- Visit Google AI Studio
- Create a new API key
- Copy the key for later use
git clone <repository-url>
cd stealth-ai-overlaynpm installCreate a .env file in the root directory:
GEMINI_API_KEY=your_gemini_api_key_here
GEMINI_MODEL=gemini-1.5-pro-latest
OVERLAY_OPACITY=0.9
AUDIO_SAMPLE_RATE=16000
SCREENSHOT_FORMAT=jpeg
SCREENSHOT_QUALITY=0.8
LOG_LEVEL=info- Right-click on sound icon in system tray โ "Open Sound settings" โ "Sound Control Panel"
- In Playback tab:
- Right-click "CABLE Input (VB-Audio Virtual Cable)"
- Select "Set as Default Device"
- Select "Set as Default Communications Device"
- Right-click "CABLE Input" โ "Properties"
- Go to "Listen" tab
- โ Check "Listen to this device"
- Select your Headphones/Speakers in dropdown
- Click "Apply" and "OK"
- In Recording tab:
- Right-click "CABLE Output (VB-Audio Virtual Cable)"
- Select "Set as Default Device"
- Select "Set as Default Communications Device"
๐ข PLAYBACK (Default): CABLE Input VB-Audio Virtual Cable
๐ค RECORDING (Default): CABLE Output VB-Audio Virtual Cable
๐ง LISTEN THROUGH: Your Headphones/Speakers
System Audio โ CABLE Input โ CABLE Output โ App Recording
โ
Your Headphones (for monitoring)
npm startnpm run build
npm run dist- Ctrl+Shift+H: Hide/Show overlay
- Ctrl+Shift+A: Open/Close AI chat
- Ctrl+Shift+S: Take screenshot + AI analysis
- Ctrl+Shift+Space: Start/Stop system audio recording
- Ctrl+\: Hide/Show ALL windows
- ๐ต Audio Button: Record system audio
- ๐ท Screenshot Button: Capture and analyze screen
- Text Input: Ask AI questions
- ๐๏ธ Clear: Clear chat history
- ๐ค Export: Export chat as JSON
When using voice chat applications:
- Discord Settings โ Voice & Video
- Input Device: "External Microphone" (your real mic)
- Output Device: "CABLE Input (VB-Audio Virtual Cable)"
- Click Settings gear in Google Meet
- Microphone: "External Microphone"
- Speakers: "CABLE Input (VB-Audio Virtual Cable)"
This allows you to:
- โ Talk using your real microphone
- โ Capture Discord/Meet audio with the app
- โ Hear everything through your headphones
- Restart the application
- Check if all dependencies are installed
-
Verify VB Cable Installation:
- Check if "CABLE Input/Output" appear in Sound settings
- Restart computer if just installed
-
Check SoX Installation:
sox --version
-
Audio Configuration:
- Ensure CABLE Input is default playback device
- Ensure CABLE Output is default recording device
- Test by playing music - you should hear it in headphones
- Verify API key in
.envfile - Check internet connection
- Ensure API key has proper permissions
- Run as administrator (Windows)
- Check if other screen capture software is running
- Verify FFmpeg installation
stealth-ai-overlay/
โโโ main.js # Main Electron process
โโโ overlay.html # Overlay window UI
โโโ test-modal.html # Chat interface UI
โโโ package.json # Dependencies and scripts
โโโ .env # Environment variables
โโโ src/
โ โโโ ai/
โ โ โโโ gemini.js # Gemini AI client
โ โโโ audio/
โ โ โโโ recorder.js # Audio recording module
โ โโโ screen/
โ โ โโโ capturer.js # Screen capture module
โ โโโ utils/
โ โโโ logger.js # Logging utility
โโโ temp/ # Temporary files (auto-created)
โโโ logs/ # Application logs (auto-created)
- Content Protection: Prevents screen capture of overlay windows
- Stealth Mode: No persistent chat history
- Temporary Files: Audio recordings automatically deleted
- No External Dependencies: All processing done locally + Gemini API
-
"Recording already in progress":
- Wait for current recording to finish
- Restart application if stuck
-
"No audio captured":
- Check VB Cable configuration
- Ensure system audio is playing
- Verify default audio devices
-
"API key error":
- Check
.envfile format - Regenerate API key if needed
- Check
-
Audio Test:
- Play music on your computer
- You should hear it in your headphones
- Press Ctrl+Shift+Space to record
- AI should describe the music
-
Screenshot Test:
- Press Ctrl+Shift+S
- AI should describe what's on your screen
-
Chat Test:
- Press Ctrl+Shift+A
- Type a message and press Enter
- AI should respond
MIT License - See LICENSE file for details
- Fork the repository
- Create a feature branch
- Commit your changes
- Push to the branch
- Create a Pull Request