clean-web-markdown-skill

Markdown-first web retrieval skill for AI agents (Cloudflare negotiation → Jina → Firecrawl fallback)

✨ Why

Agents often waste tokens on noisy HTML. This skill prioritizes clean markdown responses so downstream summarization/RAG is cheaper and more reliable.

🧠 Strategy Chain

Cloudflare Markdown Negotiation (Accept: text/markdown)
Jina Reader (https://r.jina.ai/<url>)
Firecrawl (/v1/scrape, markdown format)

🚀 Quick Start

python3 scripts/fetch_markdown.py "https://example.com/blog/post"

中文使用示例

# 抓网页正文并输出干净 Markdown
python3 scripts/fetch_markdown.py "https://example.com/文章"

# 强制走 Jina
python3 scripts/fetch_markdown.py "https://example.com" --strategy jina

Force provider:

python3 scripts/fetch_markdown.py "https://example.com" --strategy jina
python3 scripts/fetch_markdown.py "https://example.com" --strategy firecrawl --firecrawl-api-key "$FIRECRAWL_API_KEY"

📦 Output

{
  "ok": true,
  "strategy": "jina",
  "url": "https://example.com",
  "markdown": "# Title ..."
}

🎯 Trigger Phrases (EN + 中文)

Use this skill when user requests look like:

fetch/read/summarize this page as markdown
clean this URL before summarizing
抓网页正文
提取网页 Markdown
网页转 Markdown
读取网页并总结
这个链接帮我清洗一下

🧪 Tests

python3 -m unittest tests/test_fetch_markdown.py

🧩 OpenClaw Skill

Skill entry: SKILL.md
Script: scripts/fetch_markdown.py
Strategy reference: references/strategy-matrix.md

📄 License

MIT

Changelog

2026-03-11: Skill audit upgrade — normalized SKILL.md frontmatter and revalidated trigger wording/lint compatibility with OpenClaw.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
references		references
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SKILL.md		SKILL.md
SPEC.yaml		SPEC.yaml
findings.md		findings.md
progress.md		progress.md
task_plan.md		task_plan.md
triadev-project.json		triadev-project.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

clean-web-markdown-skill

✨ Why

🧠 Strategy Chain

🚀 Quick Start

中文使用示例

📦 Output

🎯 Trigger Phrases (EN + 中文)

🧪 Tests

🧩 OpenClaw Skill

📄 License

Changelog

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

clean-web-markdown-skill

✨ Why

🧠 Strategy Chain

🚀 Quick Start

中文使用示例

📦 Output

🎯 Trigger Phrases (EN + 中文)

🧪 Tests

🧩 OpenClaw Skill

📄 License

Changelog

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages