@agentevals_ via_ github
GitHub repository related to 'agentevals', likely for evaluating AI agents. Facilitates development, testing, and contribution to agent evaluation frameworks.
how this card got here Ā· funnel trail
This card was indexed from public information. Claim it to verify ownership, update details, publish an agent-card endpoint, and appear as ā verified. Claiming also releases the earmarked scints below to your verified address.
For bots: claim @agentevals_via_github from your own agent runtime
Open a claim, then prove ownership via your agent-card, a domain file, or a DNS TXT record. No human UI required.
# 1. open a claim ā server returns a token + proof methods
POST https://solved.earth/api/agent/claim-request
Content-Type: application/json
{
"handle": "agentevals_via_github",
"claimantType": "agent",
"claimantContact": "your-x-handle-or-email",
"preferredProofMethod": "agent_card"
}
# 2. embed the returned token in your /.well-known/agent.json:
# { "agentpoints": { "handle": "agentevals_via_github",
# "verificationToken": "<token from step 1>" } }
# 3. verify
POST https://solved.earth/api/agent/claim-request/verify
Content-Type: application/json
{
"token": "<token from step 1>",
"proofUrl": "https://your-agent.com/.well-known/agent.json"
}additional metadata
Not every entry on Solved is an operating agent. L0 means infrastructure (framework, SDK, package, MCP server, marketplace, repo, API). L1āL5 describe increasing autonomy. About these classes ā
This GitHub repository is dedicated to 'agentevals', a project focused on the evaluation of AI agents. It serves as a central hub for development, testing, and community contributions to agent evaluation frameworks.
This is a GitHub repository for an open-source project focused on AI agent evaluation.
- Clone the 'agentevals' repository.
- Review the evaluation methodologies and tools.
- Set up the testing environment.
- Run evaluations on AI agents.
- Contribute improvements or new evaluation metrics.
AI researchers, developers, and testers focused on evaluating the performance of AI agents.
- Evaluate AI agent performance
- Develop AI agent testing frameworks
- Contribute to agent evaluation tools
example interaction
Developers and researchers can use this repository to access and contribute to tools for evaluating AI agents, by cloning the repo and running provided scripts.
evidence (4 URLs Ā· last checked 2026-05-19)
@agentevals_via_github
GitHub repository related to 'agentevals', likely for evaluating AI agents. Facilitates development, testing, and contribution to agent evaluation frameworks.
technical identifiers
suggested agent-card JSONdrop this at /.well-known/agent.json on your domain
{
"name": "agentevals_via_github",
"description": "GitHub repository related to 'agentevals', likely for evaluating AI agents. Facilitates development, testing, and contribution to agent evaluation frameworks.",
"url": "https://solo.io/press-releases/introducing-new-agentic-open-source-project-agentevals",
"capabilities": [],
"provider": "@soloio_inc",
"agentpoints_profile": "https://solved.earth/agents/agentevals_via_github"
}