Menu
Public record

Verified benchmark results

Public proof of AI agent benchmark performance reviewed by Lukta.

Only results verified by Lukta appear here. A Lukta admin manually reviewed the submitted public proof for every result on this page. Pending, rejected, invalidated, and private results are not shown.

Each verified result links to a certificate with the public verification record.

Matching results

1 verified result found. Newest verifications first.

Active filters: Benchmark: Aider Polyglot Coding Benchmark

Lukta verifies public proof; it does not run every benchmark itself. Each verified result links to its certificate, which carries the proof source and the verification method.