prompt-atlas
A curated, versioned, searchable library of production-grade prompts for LLM trainers, AI product managers, prompt engineers, RLHF / SFT data teams, model evaluation teams, and AI application builders.
一个精选、带版本、可检索的生产级 Prompt 库,面向 LLM trainer、AI 产品 经理、Prompt 工程师、RLHF / SFT 数据团队、模型评估团队、AI 应用开发者。
🌐 Live site / 在线站点: huck012428-lab.github.io/prompt-atlas — searchable, sidebar navigation, copy-button on every prompt block. 网页版含全站搜索、侧边栏导航、prompt 一键复制。
This is not a "awesome prompts" snippet collection. Every entry is a Prompt Card: a reusable work asset with metadata, variables, examples, documented failure modes, and tuning notes.
这不是 awesome-prompts 式的素材合集。每一个条目都是一张 Prompt Card:带元数据、变量、示例、失败模式、调优笔记的可复用工作资产。
Why this exists / 为什么做这个
Production prompt work has the same problems as any other engineering discipline: people rewrite the same prompts from scratch, lose track of what works on which model, and discover failure modes the third time they ship them. Treating prompts as cards — with schema, examples, and documented failure modes — makes them reusable across teams and over time.
生产环境的 Prompt 工作和任何工程学科一样会踩同样的坑:每次都从零写、 记不清哪条 prompt 在哪个模型上稳、上线第三次才发现固定的失败模式。把 prompt 当作"卡片"——有 schema、有示例、有失败模式记录——它们才能在 团队之间和时间线上被真正复用。
What's inside / 库里有什么
Cards are organised by direction:
卡片按技术方向组织:
| Direction / 方向 | Examples / 内容举例 |
|---|---|
| RAG | Retrieval scoring, multi-hop eval synthesis, query rewriting, HyDE, citation faithfulness, answer grounding 检索打分、多跳评测题合成、query 改写、HyDE、citation 忠实度、答案扎根性 |
| Agent | ReAct planners with strict tool-call schemas 带严格 tool-call schema 的 ReAct planner |
| RLHF | Pairwise preference labelers across HHH dimensions HHH 三维度的 pairwise 偏好标注器 |
| SFT | Instruction-set augmentation from seed examples 从种子样本扩展 SFT 指令集 |
| Multimodal | VLM caption verification against actual images VLM caption 与图像内容核对 |
| CoT | Structured reasoning with rationale summaries 结构化推理 + rationale 摘要 |
| Eval | LLM-as-judge rubrics for open-ended outputs 开放式输出的 LLM-as-judge rubric |
| Code | Code review checklist, test generation, code explanation, refactor suggestions, code-eval judge 结构化 code review、测试生成、代码解释、重构建议、代码评估 |
The complete catalog lives in INDEX.md (auto-generated).
完整目录见 INDEX.md(自动生成)。
I want to... / 我想做...
Maps a goal to the card to use. New here? See docs/QUICKSTART.md for a 5-minute walkthrough.
第一次用?看 docs/QUICKSTART.md — 5 分钟从零到能用一张卡。
Evaluate / score AI outputs · 评估和打分
| Goal · 我想做 | Card · 用这张卡 |
|---|---|
| Score one AI output on factuality / coherence / completeness · 给单个 AI 输出按多维度打分 | eval/llm-judge-rubric-open-ended |
| Compare a model output against a gold answer · 用 gold 答案对照打分 | eval/reference-based-judge |
| Decompose an output into atomic claims and fact-check each · 把答案拆成原子事实逐条核查 | eval/per-claim-factuality-judge |
| Score one output on custom dimensions with confidence · 自定义维度打分 + 置信度 | eval/pointwise-quality-scorer |
| Classify an AI output for safety harms · 输出安全分类(allow/review/block) | eval/safety-output-classifier |
| Pick the best of N AI responses · 从 N 个回答里选最好的 | rlhf/best-of-n-selector |
| Label A vs B preference (HHH) · 给 A/B 两个回答打偏好标签 | rlhf/pairwise-preference-labeler |
| Pairwise judge with position-bias detection (two-call protocol) · 带位置偏置检测的 pairwise judge(双向调用) | eval/pairwise-judge-with-position-bias-probe |
| Judge a multi-turn dialogue (per-turn + conversation-level) · 多轮对话评估 | eval/multi-turn-dialogue-judge |
| Generate a domain-specific rubric with level anchors · 给具体任务自动生成定制化评分 rubric | eval/rubric-generator |
| Compare baseline vs candidate outputs and detect regressions · 检测候选版本是否退步 | eval/regression-detector |
| Diagnose LLM judge biases (length / position / format) · 诊断 LLM judge 自身偏见 | eval/judge-bias-probe |
| Check confidence calibration (predicted vs actual accuracy) · 检查置信度是否校准 | eval/calibration-checker |
| Bootstrap a small human eval study (rubric + calibration + analysis) · 设计小规模 human eval 研究 | eval/human-eval-bootstrap |
| Build a multi-benchmark leaderboard with weighting · 多 benchmark 加权 leaderboard | eval/leaderboard-builder |
| Diagnose refusal calibration (over / under / correct) · 诊断模型拒绝是否校准 | rlhf/refusal-calibration-probe |
| Generate iterative DPO pairs targeting a specific principle · 按原则生成 DPO 偏好对 | rlhf/iterative-dpo-pair-generator |
| Score whether a response matches a defined persona / brand voice · 评估回答是否符合人设 | rlhf/persona-consistency-judge |
| Detect over-cautious vs unsafe-helpful (HHH tradeoff scoring) · 诊断 helpful 和 harmless 之间的失衡 | rlhf/helpfulness-vs-harmlessness-tradeoff |
| Pairwise preference for long-form (long input + long output) · 长输入长输出的 pairwise 偏好 | rlhf/long-context-preference-labeler |
| Analyze SFT dataset coverage by topic / skill · 分析 SFT 数据集覆盖度,找 gap | sft/data-coverage-analyzer |
| Classify instruction difficulty for a target model class · 按目标模型类别给指令打难度 | sft/instruction-difficulty-classifier |
| Generate response in a defined persona with strictness control · 按人设生成回答(带严格度控制) | sft/persona-controlled-response |
| Rewrite text in a target style (formal / casual / specific voice) · 文本改写为目标风格 | sft/style-transfer |
RAG · 检索增强
| Goal · 我想做 | Card · 用这张卡 |
|---|---|
| Score whether a retrieved passage is relevant to a query · 评估 passage 与 query 的相关性 | rag/retrieval-relevance-evaluator |
| Build multi-hop QA eval questions · 合成多跳评测题 | rag/multihop-eval-synthesizer |
| Decompose / rewrite a query for retrieval · query 改写或拆解 | rag/query-rewriting-decomposition |
| Generate hypothetical answer for HyDE retrieval · HyDE 假答生成 | rag/hyde-hypothetical-answer-generator |
| Audit whether a citation actually supports a claim · 审计 citation 是否真的支持 claim | rag/citation-faithfulness-scorer |
| Detect hallucinations in a RAG answer · 检测 RAG 答案的幻觉 | rag/answer-grounding-checker |
| Summarize a long document chunk for retrieval indexing · 给长文档块产 search-friendly summary | rag/chunk-summarizer-for-retrieval |
| Compress retrieved passages into a smaller question-tailored context · 把检索结果压缩成针对问题的小上下文 | rag/context-compression |
| Resolve a chat follow-up into a standalone retrieval query · 多轮 RAG 的代词消解器 | rag/conversational-query-resolver |
| Synthesize an answer from multiple sources, surfacing conflicts · 多源综合答案 + 冲突识别 | rag/multi-source-aggregator |
| Build structured output (table / list / record) from RAG sources · RAG 结构化输出(表/列表/字段记录) | rag/structured-rag-output-builder |
| Fuse multiple sub-query retrieval results into one ranked set · 多子查询检索结果融合 | rag/query-fusion |
| Resolve time-relative phrases into concrete time bounds · 时间相对短语解析为具体时间范围 | rag/time-aware-retrieval-rewriter |
Build / debug an agent · 搭建和调试 Agent
| Goal · 我想做 | Card · 用这张卡 |
|---|---|
| Run a ReAct loop with strict tool calls · 跑 ReAct loop,严格 tool call | agent/react-planner-with-tool-schema |
| Produce a complete plan upfront · 一次性给出完整计划 | agent/plan-and-execute-planner |
| Fix a malformed tool call from a validation error · 修复格式错误的 tool call | agent/tool-call-repair |
| Reflect on whether the trajectory is on track · 反思 agent 是否在正轨 | agent/self-critique-reflection |
| Compress a long agent trajectory into memory · 把长 trajectory 压缩成 memory | agent/long-context-memory-summarizer |
| Split a complex task across multiple specialized workers · 把复杂任务派给多个专精 agent | agent/sub-task-delegator |
| Decide whether a goal needs clarification, ask one good question · 判断是否要问澄清问题,问一个好问题 | agent/clarification-asker |
| Convert OpenAPI / Swagger spec into agent tool catalog · OpenAPI 自动转 tool catalog | agent/api-spec-to-tool-catalog |
| Decide retry / abort / escalate on operation failure · 操作失败时决定重试/放弃/升级 | agent/error-recovery-strategy |
| Plan agent execution within token / dollar budget · 在预算约束下规划 agent 执行 | agent/budget-aware-planner |
| Compress verbose tool output before adding to context · 把 tool 输出压缩后再进 context | agent/tool-output-summarizer |
| Reconcile conflicting outputs from multiple sub-agents · 多 agent 冲突调解 | agent/multi-agent-conflict-resolver |
| Translate raw API response into a user-readable answer · API 响应翻译给用户 | agent/api-result-translator |
Generate / filter training data · 训练数据生成与过滤
| Goal · 我想做 | Card · 用这张卡 |
|---|---|
| Rewrite ONE instruction into N variants · 把 1 条指令改写成 N 个变体 | sft/instruction-variant-expander |
| Generate NEW instructions from seed examples · 从种子生成新指令 | sft/self-instruct-from-seed |
| Generate a high-quality response for an instruction · 给指令生成回答 | sft/response-generator |
| Filter SFT pairs by quality (keep / review / drop) · 按质量过滤 SFT 数据 | sft/data-quality-filter |
| Produce scalar reward for one response · 给单回答打 reward 分 | rlhf/pointwise-reward-scorer |
| Critique a response against a constitution and revise · 按 constitution 批评 + 重写 | rlhf/constitutional-critique-revise |
| Generate adversarial probes for safety evaluation (defensive) · 生成防御性安全评估探针 | rlhf/red-team-prompt-generator |
| Generate multi-turn conversation SFT data · 生成多轮对话 SFT 数据 | sft/conversation-sft-pair-generator |
| Pick best K few-shot demonstrations from a candidate pool · 从样本池为目标 query 选最好的 K 个示例 | sft/few-shot-example-selector |
| Detect reward hacking patterns in RLHF responses · 检测 RLHF 训练后 reward gaming 失败模式 | rlhf/reward-hacking-detector |
| Audit whether a preference label's rationale justifies the pick · 审计偏好标签的理由是否站得住 | rlhf/preference-rationalization-judge |
| Generate code-specific SFT pairs · 生成 code SFT 训练对 | sft/code-sft-pair-generator |
| Find semantic near-duplicates in instruction set · 找语义相似指令做去重 | sft/instruction-deduplicator |
Work with images · 处理图像
| Goal · 我想做 | Card · 用这张卡 |
|---|---|
| Generate a structured caption for an image · 给图片生成结构化 caption | multimodal/structured-caption-generator |
| Verify a caption against the actual image · 核对 caption 与图像 | multimodal/vlm-image-description-verifier |
| Answer a question about an image · 视觉问答 + grounding + 置信度 | multimodal/vqa-with-confidence |
| Extract typed fields from a document image · 从文档图片抽取结构化字段 | multimodal/ocr-structured-extraction |
| Extract data from a chart / plot / table image · 从图表或表格图片抽数据 | multimodal/chart-table-extractor |
| Analyze a document page's layout (title / body / tables / figures) · 分析文档页面版式结构 | multimodal/document-layout-analyzer |
| Extract graph structure from a diagram / flowchart / architecture · 流程图/架构图转结构化数据 | multimodal/diagram-to-structured-data |
| Convert a UI screenshot into a component spec · UI 截图转组件树 spec | multimodal/screenshot-to-spec |
| Classify image into custom user-defined categories · 自定义类别图像分类 | multimodal/image-classification |
| Transcribe handwriting with per-word confidence · 手写文字转录 + 字级置信度 | multimodal/handwriting-transcriber |
| Reverse-engineer edit instruction from before/after pair · 前后图反推编辑指令 | multimodal/image-edit-instruction-generator |
| Compare two images and explain similarities / differences · 双图对比解释 | multimodal/image-comparison-explainer |
Improve reasoning quality · 提升推理质量
| Goal · 我想做 | Card · 用这张卡 |
|---|---|
| Single-pass structured reasoning with rationale · 单次结构化推理 + rationale | cot/structured-reasoning-with-rationale-summary |
| Decompose a complex problem into easier sub-problems · 把复杂问题拆成更简单的子问题 | cot/least-to-most-decomposition |
| Aggregate N sampled paths into a consensus answer · 把 N 条采样路径聚合成共识答案 | cot/self-consistency-aggregator |
| Draft + verify before committing to a final answer · 先 draft 再 verify 再交答案 | cot/verify-then-finalize |
| Explore multiple branches in parallel and prune (tree-of-thoughts) · 多分支并行探索 + 剪枝 | cot/tree-of-thoughts |
| Abstract the question into a principle first, then apply (step-back) · 先抽象到原理再代入具体题 | cot/step-back-prompting |
| Critique and revise a candidate plan before execution · 执行前对推理 plan critique + 修订 | cot/plan-critique-and-revise |
| Reasoning with explicit per-step uncertainty · 明示每步不确定度的推理 | cot/uncertainty-quantification |
| Citation-grounded reasoning (every claim must cite source) · 每条事实必须引用 source 的推理 | cot/citation-grounded-reasoning |
| Contrast against intentionally-wrong reasoning · 对照错误推理路径的反向自洽 | cot/contrastive-self-consistency |
| Process external criticism (accept / correct / reject) · 处理外部批评的 self-correction 协议 | cot/self-correction-protocol |
| Generate a meta-prompt for a class of tasks · 给一类任务生成可复用的 meta-prompt | cot/meta-prompt-generator |
Work with code · 处理代码
| Goal · 我想做 | Card · 用这张卡 |
|---|---|
| Structured code review with per-dimension findings · 按维度做结构化 code review | code/code-review-checklist |
| Generate test cases for a function · 给函数生成测试用例 | code/test-case-generator |
| Explain code at a specific audience level · 按受众层级解释代码 | code/code-explanation-generator |
| Judge whether candidate code fulfills a task · 评估候选代码是否完成任务 | code/code-eval-judge |
| Suggest concrete refactors with rationale · 提结构化重构建议 | code/refactor-suggestion |
| Translate code from one language to another · 跨语言代码翻译(含 idiom 控制) | code/code-translation |
| Focused security review with CWE-style findings · 按 threat model 做代码安全评审 | code/security-review |
| Summarize git diff into structured PR description · git diff 转结构化 PR description | code/code-summary-for-pr |
| Plan major version migration grounded in actual code · 大版本迁移阶段化规划 | code/migration-plan-generator |
| Analyze impact of changing a function / API signature · 评估函数 / API 签名改动的影响范围 | code/dependency-impact-analyzer |
| Explain a stack trace / error to a target audience · 按受众解释错误信息 | code/error-message-explainer |
| Generate a commit message from a git diff · 从 diff 生成 commit message | code/commit-message-generator |
| Review API design (REST / GraphQL / gRPC) for ergonomics · API 设计评审 | code/api-design-reviewer |
As a GitHub repository / 作为 GitHub 仓库
- Browse
INDEX.mdorprompts/<direction>/. - Open the card you want; copy the Prompt section.
- Read the Failure Modes and Tuning Notes sections — that is where the experience lives.
- Substitute
{{variable}}placeholders with your inputs.
中文流程:
- 浏览
INDEX.md或prompts/<方向>/目录。 - 打开目标卡片,复制
## Prompt段落。 - 务必读
## Failure Modes和## Tuning Notes两段——那是真正的 经验所在。 - 用你自己的输入替换
{{variable}}占位符。
As a Claude Code skill / 作为 Claude Code Skill
Install this repository as a skill so Claude Code can route user intents to the right card directly:
把本仓库当作 skill 安装,Claude Code 就能根据用户描述自动定位到对应 卡片:
git clone https://github.com/huck012428-lab/prompt-atlas ~/.claude/skills/prompt-atlasThen in Claude Code:
之后在 Claude Code 中:
You: I need a prompt to score whether a retrieved passage is relevant.
Claude: [reads SKILL.md routing tree, picks rag/retrieval-relevance-evaluator,
and adapts it to your inputs]你: 帮我写个判断 retrieved passage 相关性的 prompt。
Claude:[读取 SKILL.md 的路由树,选中 rag/retrieval-relevance-evaluator,
按你的输入做适配]The skill entry is SKILL.md.
Skill 入口在 SKILL.md。
Anatomy of a Prompt Card / 一张卡片的结构
prompts/rag/retrieval-relevance-evaluator.md
├── frontmatter / 元信息块
│ ├── id, title, version, status (identity / 身份)
│ ├── direction, tags, audience, models (discovery / 发现)
│ ├── language, input/output_schema (integration / 集成)
│ └── variables (slots / 变量槽)
└── body / 正文
├── ## Purpose 适用场景与目标
├── ## Prompt 带 {{variable}} 占位符的 prompt 主体
├── ## Example 具体的输入 → 期望输出
├── ## Failure Modes 常见失败模式与检测方法
├── ## Tuning Notes 模型差异、温度、相邻用法的调优笔记
└── ## Changelog 版本历史Full schema and controlled vocabulary: docs/SCHEMA.md.
完整 schema 与受控词汇表:docs/SCHEMA.md。
Safety / 安全立场
This repository does not accept jailbreaks, safety-bypass prompts, hidden chain-of-thought extraction techniques, harm-enabling content, or proprietary leaks. See docs/SAFETY.md. Defensive and evaluation-oriented prompts (red-team rubrics, harmlessness labelers, factuality judges) are explicitly welcome.
本仓库拒收 jailbreak、绕过安全的 prompt、套取闭源模型隐藏推理链 的 prompt、有害内容生成 prompt、私有/泄露 prompt。详见 docs/SAFETY.md。明确欢迎评估类、防御类 prompt ——红队评分、有害性标注、事实性判官等。
Contributing / 贡献流程
See CONTRIBUTING.md. Short version:
详见 CONTRIBUTING.md。简要流程:
- Copy
templates/prompt-card.mdintoprompts/<direction>/<your-slug>.md.
复制templates/prompt-card.md到prompts/<方向>/<你的-slug>.md。 - Run
python scripts/validate.pyuntil it returnsOK.
跑python scripts/validate.py直到输出OK。 - Run
python scripts/build_index.pyto refreshINDEX.md.
跑python scripts/build_index.py刷新INDEX.md。 - Open a PR using the prompt-card issue template.
用 prompt-card issue 模板开 PR。
CI runs the same validation; PRs that don't pass won't be merged.
CI 跑同一套校验;不通过的 PR 不会被合入。
License / 许可证
Dual-licensed. See LICENSE.
双许可证。详见 LICENSE。
Code (
scripts/, CI configs): MITPrompt content (
prompts/,templates/,docs/): CC-BY-4.0代码(
scripts/、CI 配置):MITPrompt 内容(
prompts/、templates/、docs/):CC-BY-4.0
Each Prompt Card carries license: CC-BY-4.0 in its frontmatter for clarity.
每张 Prompt Card 的 frontmatter 中都标注 license: CC-BY-4.0,避免混淆。
Status / 当前状态
v0.1.0 — first public release with 32 Prompt Cards. Library has since grown to 100 Prompt Cards across all 7 directions (post-v0.1 additions tracked in CHANGELOG.md). See ROADMAP.md for what's planned next. Pull requests welcome.
v0.1.0 —— 首个公开版本,32 张 Prompt Card。后续已扩到 100 张,覆盖 8 个方向(v0.1 之后的新卡见 CHANGELOG.md)。后续计划见 ROADMAP.md,欢迎 PR。