Claude Skill

MindTrial

Name: MindTrial
Availability: InStock
Rating: 3.6 (1 reviews)
Author: petmal

MindTrial: Evaluate and compare AI language models (LLMs) on text-based tasks with optional file/image attachments and tool use. Supports multiple providers (OpenAI, Google, Anthropic, DeepSeek, Mistral AI, xAI, Alibaba, Moonshot AI, OpenRouter), custom tasks in YAML, and HTML/CSV reports.

★ 5 stars1 forksActiveGo

Good

View on GitHub →

📊 Score Breakdown

🛡️Security30%

5.0/5

⚡Utility30%

1.0/5

🔄Maintenance25%

5.0/5

💎Uniqueness15%

4.0/5

Overall = Security (30%) + Utility (30%) + Maintenance (25%) + Uniqueness (15%). Full methodology →

ℹ️ Details

📈 GitHub Signals

Stars

Forks

Commits (30d)

Open Issues

Last commit: 1 months ago

ai-benchmarkai-evaluation-toolsai-model-comparisonai-toolanthropicartificial-intelligence-projectsdeepseekgoogle-gemini-aigrok-ailanguage-models-aillm-benchmarkingllm-comparisonllm-evaluation-frameworkmistral-aimoonshot-aiopenaiopenrouteropensourceqwenxai

Similar Tools

View all Code Execution & Dev Tools →

Claude Skill★ Pick

100

system-prompts-and-models-of-ai-tools

by x1xhlol

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qode

★ 132.8K Active

Claude Skill★ Pick

100

ui-ux-pro-max-skill

by nextlevelbuilder

An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms

★ 48.3KPython Active

Claude Skill★ Pick

100

everything-claude-code

by affaan-m

Complete Claude Code configuration collection - agents, skills, hooks, commands, rules, MCPs. Battle-tested configs from an Anthropic hackathon winner.

★ 99.3KJavaScript Active

Claude Skill★ Pick

100

LibreChat

by danny-avila

Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, A

★ 34.9KTypeScript Active

The Weekly Index 📬

New MCP servers, Claude skills, stale alerts, and picks — every Thursday.

Data last verified: 1 months ago. See something wrong? Report it →