Scade Model Based System Testing

CTI-REALM: A new benchmark for end-to-end detection rule generation with AI agents

CTI-REALM is Microsoft’s open-source benchmark that evaluates AI agents on real-world detection engineering. It measures whether an agent can take cyber threat intelligence (CTI) and produce validated ...

United States Army

Solving The Wrong Problem: Lessons From The ATEC AI Challenge

In 2025, my team within the Soldier Evaluation Directorate won the U.S. Army Test and Evaluation Command (ATEC)’s AI Challenge with a tool that could ...

The Aviationist on MSN

Talon IQ Testbed Performs Simulated Combat Maneuvers Controlled by Hivemind and Prism AIs

The Talon IQ testbed conducted combat air patrol and target engagement maneuvers controlled by ShieldAI's Hivemind AI, before switching back to Northrop Grumman's Prism AI. Northrop Grumman and Shield ...

Design News

The Hidden Risk in Simulation-Driven Development

Teams must also be able to review and interpret these results much faster to effectively guide engineering decisions within ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results