claudekit / tools / graphify
[ Skill · Data Engineering ]

Graphify

Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.

safishamsi/graphify ·updated
$ uv tool install graphifyy && graphify install copy

What it does

Pointing an agent at a large codebase or document pile means re-reading the same files for every question — which burns through tokens fast. Graphify scans the folder once, extracts concepts and relationships into a knowledge graph, and routes future queries through the graph. The repo claims 71.5× fewer tokens per query versus reading raw files on large corpora.

Usage examples

After install, the /graphify family of slash commands builds the graph, queries it, and pulls in external sources.

# 1. Build a graph — current folder or any specific path
/graphify .
/graphify ./raw                                       # run on a specific folder

# 2. Ask what connects two concepts
/graphify query "what connects attention to the optimizer?"

# 3. Trace the path between two nodes
/graphify path "DigestAuth" "Response"

# 4. Explain a concept in graph context
/graphify explain "SwinTransformer"

# 5. Add an external source (paper, webpage) to the graph
/graphify add https://arxiv.org/abs/1706.03762        # fetch a paper, save, update graph

The headline workflow is the Andrej Karpathy /raw folder pattern that the README cites — a single folder of papers, tweets, screenshots, and notes that graphify merges into one queryable graph. Once it’s built, “what connects attention to the optimizer?” or “trace DigestAuth to Response” run against the graph instead of re-reading raw files. That’s where the token savings come from.

Key features

  • Multimodal input

    Code, SQL schemas, R/shell scripts, PDFs, markdown, screenshots, diagrams, images, and videos — all from a single folder. Claude’s vision capabilities pull concepts and relationships out of visual material too.

  • App + DB + infrastructure in one graph

    Application code, database schemas, and infrastructure definitions land in the same graph. You can trace which functions read which tables and run on which containers in a single view.

  • Persistent graph across sessions

    Results are saved to graph.json so you can query across sessions without rebuilding. SHA256 hashing reprocesses only the files that changed.

  • Multiple export formats

    HTML interactive visualization, Obsidian vault, Wikipedia-style articles, GraphML, Neo4j Cypher, and SVG — formats that drop into other tools without conversion.

  • Auto-sync

    Watch mode and git hook integration keep the graph current as code or docs evolve, no manual rebuild step required.

  • Edge provenance

    Every edge is tagged EXTRACTED, INFERRED, or AMBIGUOUS, making the line between explicit fact and model inference visible for review.

  • MCP server mode

    Pass --mcp to expose the graph as a tool to other agents or clients — pushes the skill from solo use into multi-agent setups.

Output files

FileDescription
graph.htmlInteractive visualization — click nodes and edges to explore
GRAPH_REPORT.mdHighlights and key findings as markdown
graph.jsonFull queryable graph data

Notes

  • PyPI package is temporarily graphifyy — the proper name is being reclaimed, so the package has an extra y. The slash command stays /graphify.
  • Python 3.10+ requiredgraphify install provisions the skill on your system.
  • Edge provenance tags — areas with many INFERRED or AMBIGUOUS edges are worth reviewing manually before relying on them.
  • MIT licensed.
§ 6

See also

same category · curated
[01]
[MCP] Hugging Face · Hugging Face's official remote MCP server, hosted by Hugging Face. Search models, datasets, Spaces, papers, and docs in natural language and call Gradio Space tools from Claude Code.
tool · claudekit.io / tools / huggingface
[02]
[Skill] Last30Days · An AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web — then synthesizes a grounded summary. The engine scores results by actual engagement metrics (upvotes, likes, view counts, prediction market odds) rather than editorial authority, resolves entities first (handles, subreddits, GitHub repos, hashtags), runs parallel multi-source queries, merges duplicate stories across platforms, and produces cited briefs. Reddit, HN, Polymarket, and GitHub work immediately with zero configuration.
tool · claudekit.io / tools / last30days
[03]
[Tool] Firecrawl · 🔥 The API to search, scrape, and interact with the web for AI. Three integrated capabilities — Search, Scrape, Interact — exposed through one API. Open source under AGPL-3.0 and self-hostable via docker-compose, the engine also powers the firecrawl.dev cloud SaaS run by the same team.
tool · claudekit.io / tools / firecrawl
§ 7

Frequently Asked Questions

frequently asked
§ 7.1
What is Graphify?
An AI coding assistant skill that turns any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. The headline use case is putting app code, database schemas, and infrastructure in one graph.
§ 7.2
Where does it run?
Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more. Requires Python 3.10+.
§ 7.3
How do I install it?
`uv tool install graphifyy && graphify install` — pipx and pip work the same way. Note: the PyPI package is temporarily named `graphifyy` while the proper name is being reclaimed.
§ 7.4
How do I invoke it?
Run `/graphify .` to build a graph for the current folder. It outputs `graph.html` (interactive visualization), `GRAPH_REPORT.md` (highlights), and `graph.json` (full queryable graph data).
§ 7.5
Can it run as an MCP server?
Yes — pass the `--mcp` flag to expose the graph to other agents or clients as a tool, beyond the standalone skill workflow.
§ 7.6
What's the license?
MIT licensed.