Claude Skill

LLMeval

Name: LLMeval
Availability: InStock
Rating: 3.1 (1 reviews)
Author: spenceryonce

Evaluate and compare large language models (LLMs) for chatbot applications, using various LLMs as evaluators, and manage prompt templates and binary preferences.

★ 11 stars1 forksMaintainedPython

Good

View on GitHub →

📊 Score Breakdown

🛡️Security30%

3.0/5

⚡Utility30%

2.0/5

🔄Maintenance25%

4.0/5

💎Uniqueness15%

4.0/5

Overall = Security (30%) + Utility (30%) + Maintenance (25%) + Uniqueness (15%). Full methodology →

ℹ️ Details

📈 GitHub Signals

Stars

Forks

Commits (30d)

Open Issues

Last commit: 5 months ago

anthropicchatgptclaudecohereevaluationevaluatorllmopenai

Similar Tools

View all AI & LLM Tools →

Claude Skill★ Pick

100

chatgpt-on-wechat

by zhayujie

CowAgent是基于大模型的超级AI助理，能主动思考和任务规划、访问操作系统和外部资源、创造和执行Skills、拥有长期记忆并不断成长。同时支持飞书、钉钉、企业微信应用、微信公众号、网页等接入，可选择OpenAI/Claude/Gemini/DeepSeek/ Qwen/GLM/Kimi/LinkAI，能处理文本、语

★ 42.2KPython Active

MCP Server★ Pick

100

context7

by upstash

Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors

★ 49.2KTypeScript Active

Claude Skill★ Pick

100

chatbox

by chatboxai

Powerful AI Client

★ 39.0KTypeScript Active

Claude Skill★ Pick

100

lobehub

by lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level —

★ 73.7KTypeScript Active

The Weekly Index 📬

New MCP servers, Claude skills, stale alerts, and picks — every Thursday.

Data last verified: 3 weeks ago. See something wrong? Report it →