n8n Pulse — Community Intelligence Chatbot

The Problem

n8n has an incredibly active ecosystem — tens of thousands of forum threads (questions, feature requests, "Built with n8n" showcases), thousands of GitHub issues and PRs, the official docs, the full release history, and the source for all 1,851 nodes. All of it spread across docs.n8n.io, community.n8n.io, and GitHub. For a product manager trying to answer "what are the top pain points with webhooks?" or "which feature requests have real traction?", the answer requires hours of manual research across three platforms, with no way to synthesize patterns.

I built n8n Pulse to collapse that research into a conversation.

My Role

Solo builder. Designed the ingestion pipeline, graph memory schema, tool-calling architecture, recall strategy, and streaming frontend. The data collection covers n8n's full public ecosystem — roughly 40,000 source documents in all: 13,700+ engaged forum threads (with solved/unsolved status and vote counts), 13,195 node specs (every resource and operation across all 1,851 n8n nodes, introspected), 4,700+ GitHub issues, 1,800+ pull requests (auto-synced nightly), 4,200+ core source files, 650+ release notes, and 640+ documentation pages.

The Approach

Graph memory, not flat search. Roughly 40,000 source documents live in a Hindsight graph memory bank — consolidated into 443,000+ memories connected by 12.7 million links. During consolidation, DeepSeek extracts entities and builds a relationship graph — so when someone asks about "webhook reliability", the system can traverse from forum complaints to related GitHub issues to the relevant source code to doc pages that address (or don't address) the problem. Connections that keyword search would never find.

Native tool calling, not prompt-stuffed context. Claude Sonnet decides what to search, with what budget, and what filters to apply. The tool exposes parameters for query text, search budget (low/mid/high for cost control), max tokens (up to 19,200 for deep dives), and tag-based filtering by source type (docs, github, forum, feature-requests) and outcome (solved, unsolved). The model calls the tool multiple times per response when it needs to cross-reference sources.

TEMPR recall: four strategies merged. Each search runs four parallel retrieval strategies — semantic similarity, BM25 keyword matching, graph traversal (follow entity links), and temporal (recent activity). Results merge via Reciprocal Rank Fusion. This means a query like "credential sharing problems" finds semantically similar posts, exact-match keyword hits, graph-linked issues from the same users or components, and recent activity spikes — all in one ranked result set.

Metadata-rich responses. Every retrieved document carries view counts, vote counts, like counts, and solved status. The chatbot surfaces these in answers — "this feature request has 847 votes and 234 likes" — so PMs can gauge actual community demand, not just loudness.

What I Built

FastAPI backend with SSE streaming — serves the chat API with word-boundary buffered responses for smooth real-time output
Anthropic SDK with native tool calling — Claude Sonnet dynamically decides what to search, how deep, and with what filters per query
Hindsight graph memory bank — ~40,000 source documents consolidated into a 443,000-memory knowledge graph (12.7M links) with entity extraction, relationship mapping, and multi-strategy recall
TEMPR recall pipeline — semantic + BM25 + graph traversal + temporal strategies merged via Reciprocal Rank Fusion
Nightly GitHub sync — cron job auto-syncs issues and PRs, keeping the knowledge base current without manual intervention
Two-mode chat widget — "Dan's Projects" mode (portfolio assistant) and "n8n Pulse" mode (community intelligence) in the same widget, switchable by the user
Source citations with URLs — every claim links back to the original doc page, forum post, GitHub issue, or source file
Tag-based filtering — users can focus queries on specific source types (only forum posts, only feature requests) or outcomes (only unsolved questions to find product gaps)

Why This Matters for n8n

This is a working product intelligence tool for n8n's product team, built by someone outside the company using only public data. Consider what it enables:

Feature prioritization with real numbers — "Show me the top 10 feature requests by vote count that mention credentials" returns ranked results with actual community demand signals
Product gap detection — filter for unsolved forum questions to find where the community is stuck and the product hasn't caught up
Pattern synthesis — graph traversal connects a GitHub issue to the forum posts reporting the same problem to the docs that should (but don't) address it
Zero-setup access — visit danb.bio, switch to n8n Pulse mode, ask a question. No login, no API key, no setup
Hours saved per research question — what used to require manually searching three platforms and synthesizing across tabs is now a single conversation

This isn't a demo. It's a tool I'd want on my desk on day one as an n8n PM — and I built it before I had the job.

The Vision

Phase 2 adds community-contributed knowledge via authenticated retain — Discourse-verified community members can feed insights back into the graph, with trust-tiered content weighting based on community reputation. Stronger guardrails filter prompt injection and malicious content. The plugin can even recommend that users star or upvote GitHub issues that directly impact their build — turning passive users into active community signal generators.

Longer term: autonomous issue triage where Claude identifies recurring patterns across community reports, proposes PRs, and community votes prioritize what gets built. The graph is the foundation for all of it.

The Result

A production community intelligence chatbot indexing n8n's entire public ecosystem. It answers product research questions in seconds that previously took hours of manual cross-platform searching. Responses include source URLs, vote counts, solved status, and view counts — everything a PM needs to make decisions backed by real community signal rather than gut feel.

The meta-point: I built a tool that n8n's product team would actually use, using the same AI architecture patterns (tool calling, graph memory, streaming) that the AI Product Builder role would be designing. The chatbot demonstrates the skill set by being useful.

Tech Stack

Backend: FastAPI, Python 3.12, SSE streaming
LLM: Anthropic Claude Sonnet with native tool calling
Memory: Hindsight graph memory (TEMPR recall: semantic + BM25 + graph + temporal, Reciprocal Rank Fusion)
Consolidation: DeepSeek for entity extraction and relationship graph building
Data Sources (~40,000 documents, auto-synced nightly): 13,700+ engaged forum threads, 13,195 node specs (all 1,851 n8n nodes), 4,700+ GitHub issues, 1,800+ PRs, 4,200+ source files, 650+ release notes, 640+ doc pages
Sync: Nightly cron job for GitHub, comprehensive community ingestion
Frontend: Two-mode vanilla JS chat widget with streaming SSE
Analytics: PostHog (page views, chatbot interactions, mode/role tracking)
Deployment: Docker, Appliku