Offensive Cybersecurity AI Time Horizons

Source: Lyptus Research | Date: April 2026 | Score: 4.5/5
TL;DR: AI offensive cybersecurity capabilities are improving rapidly with a doubling time of just 5.7 months (recent models). The most capable models can now succeed on 50% of tasks taking human experts 3+ hours.

Key Findings

Doubling Time: 9.8 months (2019-present), steepening to 5.7 months for models released since 2024

Critical Incidents

Methodology

Uses METR's time-horizon methodology - measuring AI capability growth in human-equivalent task time. Tasks labeled by time skilled human would take. Model's time horizon at given success rate is human-time difficulty at which its fitted success curve crosses that threshold.

Benchmarks Used

Implications

This research suggests that frontier AI offensive-cyber capability may diffuse into open-weight form on relatively short timelines. The ecological validity is limited to bounded and verifiable offensive subtasks rather than full scope of real-world operations.

Warning: These results are lower bounds on early-2026 frontier capability due to evaluation budget limitations.