microsoft/maro
Microsoft's Multi-Agent Resource Optimization (MARO) platform applies reinforcement learning to real-world resource optimization problems like logistics, inventory management, and transportation.

MARO is a Reinforcement Learning as a Service platform designed for resource optimization across domains including logistics, supply chain, and transportation. It provides multi-agent simulation environments and RL algorithms to train agents that make optimized decisions for complex operational problems such as bike rebalancing, container inventory management, and fleet scheduling.