Skip to content
@soliplex

Soliplex

Pydantic-AI, RAG, Ingestion, AG-UI, and Opinions

Soliplex

Soliplex is a self-hosted, AI-powered Retrieval-Augmented Generation (RAG) platform. It combines semantic document retrieval with generative AI to deliver accurate, cited responses grounded in your document collections. The ecosystem spans ingestion, indexing, retrieval, and user interface -- providing a complete pipeline from raw documents to conversational AI.

Full documentation: soliplex.github.io

Architecture

graph TB
    
    subgraph "User Interfaces"
        FE["Frontend<br/>(Flutter / Dart)"]
        TUI["Terminal UI<br/>(Textual)"]
    end


    subgraph "Backend"
        SX["Soliplex<br/>(FastAPI)"]
        MCP["MCP Server<br/>(FastMCP)"]
    end

    subgraph "AI & Retrieval"
        PAI["Pydantic AI<br/>Agents"]
        HR["Haiku RAG<br/>(LanceDB)"]
        HS["Haiku Skills"]
    end

    
    subgraph "Storage"
        LDB["LanceDB<br/>(Vectors)"]
        PG["PostgreSQL / SQLite<br/>(Metadata)"]
    end
    subgraph LLM["LLM Providers"]
        OL["Ollama"]
        OA["OpenAI"]
        GE["Gemini"]
        AN["Anthropic"]
    end
        subgraph "Ingestion Pipeline"
        IA["Ingester Agents"]
        SI["Soliplex Ingester"]
        PS["PDF Splitter"]
        DL["Docling"]
    end
    subgraph "Document Sources"
        FS["Filesystem"]
        SCM["GitHub / Gitea"]
        WD["WebDAV"]
        WEB["Web Pages"]
        S3["S3"]
    end


    FE -->|"AG-UI Protocol"| SX
    TUI --> SX
    SX --> PAI
    SX --> MCP
    PAI --> HR
    PAI --> HS
    LDB --> HR

    FS --> IA
    SCM --> IA
    WD --> IA
    WEB --> IA
    S3 --> IA
    IA -->|"Discovers & fetches"| SI
    PS -->|"Splits large PDFs"| SI
    SI -->|"Parse, chunk, embed"| DL

    PAI <-->|"Inference"| LLM

    SI --> LDB
    SI --> PG
Loading

Components

Soliplex (Backend)

The core backend service built with FastAPI and Python. Soliplex orchestrates AI agents, RAG retrieval, and real-time chat across isolated room environments.

  • Multi-room architecture with independent configurations and knowledge bases
  • Multi-provider LLM support (OpenAI, Ollama, Gemini, Anthropic)
  • OIDC/JWT authentication with Keycloak integration
  • MCP server and client for extended AI tool integration
  • AG-UI protocol for structured frontend communication
  • Quiz system with LLM-based evaluation
  • 100% unit test branch coverage

A cross-platform Flutter application that serves as the primary user interface. Runs on Android, iOS, macOS, Web, Linux, and Windows.

  • Multi-server OIDC authentication with secure credential storage
  • Room-based chat with streamed AI responses and execution step visibility
  • RAG document filtering with source citations and PDF chunk visualization
  • File uploads, interactive quizzes, and network diagnostics
  • Responsive Material Design 3 layouts for mobile through desktop
  • Modular shell architecture with composable feature modules

A multi-source document discovery and delivery system. Ingester Agents connect to diverse origins and feed documents into the ingestion pipeline.

  • Filesystem, WebDAV, GitHub, Gitea, web page, and S3 source support
  • Declarative YAML manifests with cron scheduling
  • SHA-256 content hashing for incremental sync (only changed files processed)
  • Async-first architecture with retry logic and exponential backoff
  • FastAPI REST API and Typer CLI

The document processing engine that transforms raw files into indexed, searchable vector data.

  • Five-stage pipeline: validate, parse, chunk, embed, store
  • Web UI and REST API for monitoring and managing ingestion runs
  • Async worker-based processing tested at scale (1000+ page PDFs)
  • Parameter set management for tuning chunking and embedding
  • PostgreSQL metadata storage with LanceDB vector storage

A split-process-merge pipeline for converting large PDFs into structured Docling documents through intelligent chunking and parallel processing.

  • Bookmark-aware segmentation with multiple splitting strategies
  • Parallel processing via ProcessPoolExecutor with strict memory isolation
  • Document reassembly preserving DOM structure, provenance, and page offsets
  • CLI tools for analysis, chunking, conversion, and validation

Key Libraries

Library Role
Haiku RAG RAG engine with hybrid vector + full-text search, agentic QA, and multi-provider embeddings
Haiku Skills Modular skill system for Pydantic AI agents with sub-agent execution and AG-UI streaming
Docling Document conversion and structural extraction for PDFs and other formats
Pydantic AI Agent framework with function calling, tool integration, and multi-provider LLM support
LanceDB Embedded vector database for semantic search and retrieval
FastMCP Model Context Protocol server/client for AI tool interoperability

Documentation

Full configuration guides, tutorials, and API reference are available at soliplex.github.io.

Popular repositories Loading

  1. soliplex soliplex Public

    An AI-powered Retrieval-Augmented Generation (RAG) system with a modern web interface.

    Python 12 5

  2. flutter flutter Public archive

    Flutter frontend for Soliplex

    Dart 2 3

  3. ingester ingester Public

    content ingestion pipeline that works with soliplex

    Python 1

  4. ingester-agents ingester-agents Public

    tools to support loading data into soliplex ingester

    Python 1

  5. markdown_widget markdown_widget Public

    Forked from asjqkkkk/markdown_widget

    📖Rendering markdown by flutter!Welcome for pr and issue.

    Dart 1

  6. frontend frontend Public

    Frontend for Soliplex

    Dart 1 4

Repositories

Showing 10 of 14 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…