Real-time multi-modal AI perception system for Windows that fuses screen activity, camera vision, and voice input to generate contextual awareness.