Database Schema Spec Generator

A Python package for generating unified JSON documentation files for database schemas by resolving JSON Schema references and handling oneOf variants. This tool processes modular database schema specifications and generates consolidated documentation for different database engines and versions.

User Project Structure

The generated schemas are designed to validate user projects with this structure:

my-project/
├── .bfloo/                                    # Hidden config directory (like .git)
│   ├── config.yml                            # All schemas configuration
│   ├── orders/                               # Schema: "orders"
│   │   ├── manifest.yml                      # Snapshot registry
│   │   └── 2024-01-15_v1.0.0.yml             # Snapshot files
│   ├── users/                                # Schema: "users"
│   │   └── manifest.yml
│   └── analytics/                            # Schema: "analytics"
│       └── manifest.yml
├── schemas/                                  # Custom directory (via dir: "schemas")
│   ├── orders.yml                            # Working schema for "orders"
│   └── users.yml                             # Working schema for "users"
└── db-schemas/
    └── analytics.yml                         # Working schema at root (dir omitted)

Key concepts:

Schema names are user-defined - orders, users, analytics, etc.
Flat structure - Each schema is a top-level entry (no nested hierarchy)
One manifest per schema - Each schema has its own snapshot history in .bfloo/<schema>/
Configurable working directory - Use dir to specify where <schema>.yml is stored (default: .db-schemas/)
Per-schema API keys - Each schema has its own API key for sync

🚀 Quick Start

Prerequisites

uv package manager

Installation

Clone the repository:

git clone https://github.com/Bfloo-App/database-schema-spec.git
cd database-schema-spec

Install dependencies using uv:

uv sync --frozen

Set up environment variables by creating a .env file:

cp .env.example .env
# Edit .env and set BASE_URL to your desired URL

Note: The BASE_URL environment variable is required. The application will fail to start if it's not set.

🏃‍♂️ Running the Application

Using uv (Recommended)

# Run the schema generator
uv run main.py

Using Python directly

# Activate the virtual environment first
source .venv/bin/activate  # On Unix/macOS
# or
.venv\Scripts\activate     # On Windows

# Then run
python main.py

📁 Project Structure

database-schema-spec/
├── main.py                           # Entry point
├── .env                             # Environment configuration
├── pyproject.toml                   # Project dependencies
├── database_schema_spec/            # Main package
│   ├── cli/                        # Command-line interface
│   ├── core/                       # Core functionality
│   │   ├── config.py               # Configuration management
│   │   ├── exceptions.py           # Custom exceptions
│   │   └── schemas.py              # Data models
│   ├── io/                         # Input/output handling
│   ├── logger/                     # Logging configuration
│   ├── resolution/                 # Schema resolution logic
│   └── validation/                 # Schema validation
├── docs/                           # Input schema files
│   └── schemas/
│       ├── _registry_.json         # Engine/version registry
│       ├── project/
│       │   ├── manifest.json       # Snapshot manifest schema
│       │   └── config/
│       │       ├── base.json       # Common config schema (with $defs)
│       │       └── engines/
│       │           └── postgresql.json  # PostgreSQL-specific config (references base.json)
│       └── engines/
│           └── postgresql/
│               └── v15.0/          # Version-specific schemas
│                   ├── tables.json     # Tables array schema (AI-focused)
│                   ├── snapshot/
│                   │   ├── stored.json   # Stored snapshot schema
│                   │   └── working.json  # Working snapshot schema
│                   └── components/
└── output/                         # Generated output files
    ├── smap.json                   # Schema map (discovery file)
    ├── manifest.json               # Manifest schema with $id
    ├── config/
    │   └── postgresql.json         # Fully-resolved PostgreSQL config (self-contained)
    └── postgresql/
        └── v15.0/
            ├── tables.json         # Tables array schema (AI-focused)
            └── snapshot/
                ├── stored.json     # Stored snapshot schema (CLI)
                └── working.json    # Working snapshot schema (CLI)

🧪 Development

Running Tests

# Run all tests: NOTE some default flags are already set on puproject.toml
uv run pytest

# Run specific test file example
uv run pytest tests/test_integration.py

Code Quality

# Lint code
uv run ruff check

# Format code
uv run ruff format

# Type checking
uv run pyright

Pre-commit Hooks

# Install pre-commit hooks
uv run pre-commit install

# Run pre-commit *manually* on all files
# Once pre-commit is installed it should run everytime you attempt to commit changes on the changed files
uv run pre-commit run --all-files

📝 Environment Variables

Variable	Required	Description	Example
`BASE_URL`	✅ Yes	Base URL for generated schema files	`https://api.example.com/schemas`

🔧 Configuration

The application can be configured through:

Environment Variables: Set in .env file or system environment
Configuration Constants: Defined in database_schema_spec/core/config.py

Default Paths

Input Directory: docs/schemas/ (contains source schema files)
Output Directory: output/ (generated files are written here)
Registry File: docs/schemas/_registry_.json (engine/version registry)

📤 Output

The generator creates:

Schema Map (smap.json): Discovery file mapping all available schemas
Project Schemas: Config and manifest schemas with injected $id fields
Engine Specs: Fully resolved database-specific schemas organized by engine and version

Example output structure:

output/
├── smap.json                   # Schema map for discovery
├── manifest.json               # Manifest schema
├── config/
│   └── postgresql.json         # Fully-resolved PostgreSQL config (self-contained)
└── postgresql/
    └── v15.0/
        ├── tables.json         # Tables array schema (AI-focused)
        └── snapshot/
            ├── stored.json     # Stored snapshot schema (CLI)
            └── working.json    # Working snapshot schema (CLI)

Note: Each engine config file (e.g., postgresql.json) is fully resolved with all $ref references inlined, making it completely self-contained. This eliminates the need for separate base.json and engine-specific files in the output.

Schema Map (smap.json)

The schema map provides a structured index of all generated schemas:

{
	"project": {
		"manifest": "https://example.com/schemas/manifest.json",
		"config": {
			"postgresql": "https://example.com/schemas/config/postgresql.json"
		}
	},
	"engines": {
		"postgresql": {
			"v15.0": {
				"tables": "https://example.com/schemas/postgresql/v15.0/tables.json",
				"snapshot": {
					"stored": "https://example.com/schemas/postgresql/v15.0/snapshot/stored.json",
					"working": "https://example.com/schemas/postgresql/v15.0/snapshot/working.json"
				}
			}
		}
	}
}

The config section maps engine names directly to their fully-resolved schema URLs, making it easy to fetch the appropriate config schema for any supported database engine.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github		.github
database_schema_spec		database_schema_spec
docs		docs
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Database Schema Spec Generator

User Project Structure

🚀 Quick Start

Prerequisites

Installation

🏃‍♂️ Running the Application

Using uv (Recommended)

Using Python directly

📁 Project Structure

🧪 Development

Running Tests

Code Quality

Pre-commit Hooks

📝 Environment Variables

🔧 Configuration

Default Paths

📤 Output

Schema Map (smap.json)

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Bfloo-App/database-schema-spec

Folders and files

Latest commit

History

Repository files navigation

Database Schema Spec Generator

User Project Structure

🚀 Quick Start

Prerequisites

Installation

🏃‍♂️ Running the Application

Using uv (Recommended)

Using Python directly

📁 Project Structure

🧪 Development

Running Tests

Code Quality

Pre-commit Hooks

📝 Environment Variables

🔧 Configuration

Default Paths

📤 Output

Schema Map (smap.json)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages