jmerelnyc/Photo-agents
A Python package providing a perceive-reason-act agent loop enabling vision-grounded autonomous agents to operate computers with layered memory and self-authored skills.

Photo Agents is a runtime for building vision-grounded autonomous agents that operate computers like humans do. It implements a streaming agent loop that drives tool-calling LLMs through a perceive, reason, act cycle with layered observation storage for memory. The framework includes a multi-provider LLM router and enables agents to write their own skills from successful experiences. It runs locally to preserve user privacy and data ownership.