Core Methodology
Source Curation
Sources are added through an admin workflow with category, strategy, interval, and trust metadata.
Ingestion Policy
Web3Seeker prioritizes RSS, then sitemap, then HTML fallback with crawl guardrails and rate control.
Search Scope
Search only runs over indexed content collected from curated sources, not the global internet.
Attribution
Every item links back to the original publisher URL. Web3Seeker is a discovery layer.
Category Taxonomy
V1 taxonomy is intentionally stable and compact. Emerging topics are represented as trends inside existing categories instead of becoming new top-level categories.
NewsChainsProtocolsTechnicalResearchSecurityGovernanceMarkets
Operational Notes
- Admins can pause, update, or re-crawl sources through protected endpoints.
- Crawl health and stale-source detection are tracked operationally.
- Search index updates run as scheduled jobs and can be manually requeued.