Summary
Add support for indexing and searching .org files (org-mode format) alongside markdown files.
Motivation
Currently datacortex only indexes markdown files (zettel, page, journal, etc.). This means GTD tasks in org/inbox.org and org/next_actions.org are not searchable via /search or the knowledge graph.
Being able to search across both knowledge (markdown) and tasks (org) would enable queries like:
- "What tasks mention the Dubai pilot?"
- "Show me all TODO items related to fundraising"
- "Find blocked tasks from last month"
Current State
-- File types currently indexed:
SELECT DISTINCT type FROM files;
-- active, agent, clipping, journal, page, readme, research, zettel
-- Org files indexed:
SELECT COUNT(*) FROM files WHERE path LIKE '%.org';
-- 0
Proposed Solution
-
Add org parser to datacortex/indexer/ that extracts:
- Headlines (as searchable titles)
- TODO state (TODO, NEXT, WAITING, DONE)
- Tags (
:tag1:tag2:)
- Properties (
:PROPERTIES: drawer)
- Body content
- Timestamps
-
New node types: task, project, heading
-
Link extraction: Parse [[wiki-links]] and file: links to build graph connections
-
Embedding: Embed task headlines + context for semantic search
Example Use Cases
# Find tasks about a topic
datacortex search "dubai pilot" --type task
# Search across everything
datacortex search "fundraising deadline"
References
🤖 Generated with Claude Code
Summary
Add support for indexing and searching
.orgfiles (org-mode format) alongside markdown files.Motivation
Currently datacortex only indexes markdown files (zettel, page, journal, etc.). This means GTD tasks in
org/inbox.organdorg/next_actions.orgare not searchable via/searchor the knowledge graph.Being able to search across both knowledge (markdown) and tasks (org) would enable queries like:
Current State
Proposed Solution
Add org parser to
datacortex/indexer/that extracts::tag1:tag2:):PROPERTIES:drawer)New node types:
task,project,headingLink extraction: Parse
[[wiki-links]]andfile:links to build graph connectionsEmbedding: Embed task headlines + context for semantic search
Example Use Cases
References
~/.datacore/lib/org_parser.py🤖 Generated with Claude Code