Live AI Assistance - Rachel Vision

Take any regular pair of glasses and upgrade them with an ESP-32 camera, and a microphone for a voice controlled AI experience. On-device processing for real-time object recognition, scene description, and voice interaction.

GitHub Repository ↗ Watch Demo

Rachel Vision AI glasses prototype

Full Feature Set & Required Materials

Camera Integration: AI models can use attached camera for real-time object recognition, scene description, and text reading.
General AI Capabilities: Can solve complex problems, answer questions, and generate content based on user prompts. Similar performance to the ChatGPT 5 Base Model.
Speaker: Talks back to you for responses.
Wireless: All components are independent of each other and are connected via a central network.

ESP-32 CAM Module: $10 - $15
Microcontroller: ESP-32 S3 $10 - $20
Microphone: $5 - $10
Power Source: Small LiPo battery or battery pack, $10 - $20
Glasses: Cheap blue light or plastic glasses, $5 - $15
ESP-32 CAM → Micro USB Adapter: $5 - $10
Miscellaneous: Wires, connectors, soldering equipment

Buy List

Getting Started

Clone the repository to build locally.

git clone https://github.com/chromemilk/Rachel-Vision

View License (MIT)

Build & Programming Instructions

Pre-build: download Arduino IDE and related packages

Obtain required materials (listed above)
Wire microphone to the ESP-32 S3 Module according to the firmware code (remember L/R should be tied to GND)
Wire ESP-32 CAM to the adapter according to the firmware code
Flash ESP-32 S3 Module with the provided "mic" firmware
Flash ESP-32 CAM Module with the provided "cam" firmware
Attach hardware to glasses via tape or 3D-printed parts

Usage Instructions

Pre-use: Obtain relevant API keys (look in Python script) and place them in a .env file in the local directory

Have a local network set up for all the devices to connect to
Power on the Rachel Vision device
Run provided server Python script
Interact with the device using the camera and microphone (say "Rachel, what is this?" to trigger a response)