Bulldozer DB by N2D4 · Pull Request #1285 · stack-auth/stack-auth

N2D4 · 2026-03-24T02:51:08Z

Note

Medium Risk
Introduces a new persisted storage table plus large amounts of SQL logic and raw SQL execution paths (including $executeRawUnsafe), which could impact data integrity and migrations if enabled in production. Most additions are dev tooling/tests, but the new migration and schema model warrant careful review.

Overview
Adds a new BulldozerStorageEngine persisted table (Prisma model + migration) using jsonb[] key paths, a generated keyPathParent column with self-referential FK, seed root rows, and index coverage; includes migration tests validating hierarchy queries/constraints.

Introduces Bulldozer Studio (run-bulldozer-studio.ts) and dev wiring (pnpm dev + run-bulldozer-studio script, new elkjs dep) to visualize Bulldozer table graphs, inspect table details, and perform init/delete and row mutations via raw SQL.

Adds substantial Bulldozer DB verification: a new SQL helper bundle for sort-table operations (BULLDOZER_SORT_HELPERS_SQL), an example composed schema, and a large Postgres-backed fuzz test suite exercising table operators under randomized mutations/re-inits; also tweaks cron job runner to delay execution at startup and updates minor docs/editor config.

^{Written by Cursor Bugbot for commit d3a2daa. This will update automatically on new commits. Configure here.}

Summary by CodeRabbit

New Features
- Added Bulldozer Studio, a development tool for inspecting, visualizing, and managing table schemas and data storage through an interactive web UI with graph-based visualization and raw storage editing capabilities.
Infrastructure
- Introduced database infrastructure supporting hierarchical table definitions and persistent data management.
- Added performance and integration test suites for table operations and database interactions.
Chores
- Updated development scripts and dependencies to support new Bulldozer systems.

vercel · 2026-03-24T02:51:13Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
stack-auth-hosted-components	Ready	Preview, Comment	Mar 28, 2026 2:08am
stack-backend	Error		Mar 28, 2026 2:08am
stack-dashboard	Ready	Preview, Comment	Mar 28, 2026 2:08am
stack-demo	Ready	Preview, Comment	Mar 28, 2026 2:08am
stack-docs	Error		Mar 28, 2026 2:08am

coderabbitai · 2026-03-24T02:52:12Z

📝 Walkthrough

Walkthrough

Introduced a comprehensive Bulldozer table storage and query system for the backend, including a PostgreSQL-backed hierarchical storage engine, multiple table declaration factories (stored, groupBy, map, flatMap, filter, limit, concat, sort, lFold, leftJoin), SQL execution utilities, a web-based Bulldozer Studio UI, integration/fuzz/performance tests, and supporting documentation.

Changes

Cohort / File(s)	Summary
Configuration & Documentation `.vscode/settings.json`, `AGENTS.md`, `claude/CLAUDE-KNOWLEDGE.md`, `apps/dev-launchpad/public/index.html`	Updated editor spellcheck dictionary, added lint/communication guidance, new Q&A entries for Bulldozer-specific patterns, and added Bulldozer Studio app entry to dev launchpad.
Database Schema & Migration `apps/backend/prisma/schema.prisma`, `apps/backend/prisma/migrations/.../migration.sql`, `apps/backend/prisma/migrations/.../tests/ltree-queries.ts`	Created new `BulldozerStorageEngine` table with JSONB-based hierarchical storage, self-referencing foreign key, unique constraint on `keyPath`, and comprehensive integration tests validating schema structure and constraints.
Backend Scripts & Dependencies `apps/backend/package.json`, `apps/backend/scripts/run-bulldozer-studio.ts`, `apps/backend/scripts/run-cron-jobs.ts`	Added `elkjs` dependency, new `run-bulldozer-studio` script launching a web UI server (2000+ lines) that exposes API endpoints for schema inspection, table mutation, and raw storage navigation; added 30s startup delay to cron-job runner.
Core Bulldozer DB Module & Utilities `apps/backend/src/lib/bulldozer/db/index.ts`, `apps/backend/src/lib/bulldozer/db/utilities.ts`, `apps/backend/src/lib/bulldozer/db/bulldozer-sort-helpers-sql.ts`, `apps/backend/src/lib/bulldozer/db/example-schema.ts`	Exported core `Table<GK, SK, RD>` interface and SQL execution utilities, introduced 778-line SQL helper module for balanced-tree sort maintenance via `BulldozerStorageEngine`, defined example fungible-ledger schema with 231 lines of composed table operators.
Table Declaration Factories `apps/backend/src/lib/bulldozer/db/tables/stored-table.ts`, `apps/backend/src/lib/bulldozer/db/tables/group-by-table.ts`, `apps/backend/src/lib/bulldozer/db/tables/map-table.ts`, `apps/backend/src/lib/bulldozer/db/tables/flat-map-table.ts`, `apps/backend/src/lib/bulldozer/db/tables/filter-table.ts`, `apps/backend/src/lib/bulldozer/db/tables/limit-table.ts`, `apps/backend/src/lib/bulldozer/db/tables/concat-table.ts`, `apps/backend/src/lib/bulldozer/db/tables/sort-table.ts`, `apps/backend/src/lib/bulldozer/db/tables/l-fold-table.ts`, `apps/backend/src/lib/bulldozer/db/tables/left-join-table.ts`	Implemented ten table operator factories enabling declarative data transformation pipelines: each manages hierarchical storage in `BulldozerStorageEngine`, registers row-change triggers for incremental updates, and exports listing/initialization/deletion operations.
Test Suites `apps/backend/src/lib/bulldozer/db/index.fuzz.test.ts`, `apps/backend/src/lib/bulldozer/db/index.perf.test.ts`	Added fuzz test suite (1809 lines) validating operator compositions against randomized mutations with in-memory expected-state tracking, and performance/regression test suite (1026 lines) benchmarking initialization, query, and mutation latencies against configurable thresholds.
Prisma Client `apps/backend/src/prisma-client.tsx`	Refactored static import to lazy dynamic import of `getStackServerApp` within connection-string resolution path.

Sequence Diagram(s)

sequenceDiagram
    participant UI as Bulldozer Studio UI
    participant API as Backend API
    participant PG as PostgreSQL
    participant BSE as BulldozerStorageEngine<br/>Table
    
    UI->>API: GET /api/schema
    API->>API: Build table registry from<br/>exampleFungibleLedgerSchema
    API->>PG: Query BSE for all table metadata<br/>& dependencies
    PG->>PG: Retrieve keyPath hierarchy
    PG-->>API: Table snapshot data
    API->>API: Compute ELK graph layout
    API-->>UI: JSON schema + graph layout
    UI->>UI: Render interactive dependency graph
    
    rect rgba(100, 150, 255, 0.5)
    UI->>API: POST /api/table/:tableId/set-row
    API->>PG: BEGIN transaction
    API->>PG: Execute row-change triggers<br/>(registerRowChangeTrigger callbacks)
    PG->>PG: Update BulldozerStorageEngine<br/>keyPath entries
    PG->>PG: Propagate changes through<br/>downstream tables (stored→group→map→...)
    PG->>PG: COMMIT with advisory lock
    PG-->>API: Success
    API-->>UI: Updated row
    end
    
    UI->>API: GET /api/raw/node?path=...
    API->>PG: Query BSE for keyPath node<br/>& children enumeration
    PG-->>API: { value, children }
    API-->>UI: Raw storage tree node

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~120 minutes

Possibly related PRs

e2e: isolate external DB sync cleanup per suite #1148: Modifies the same cron-runner script (apps/backend/scripts/run-cron-jobs.ts) that receives startup delay changes in this PR.

Poem

🐰 A bulldozer digs through JSON rows,
Storage hierarchies in JSONB flows,
Tables transform and fold with care,
Triggers cascade through the database air,
From stored to grouped to sorted bliss—
What a magnificent data mesh this is!

🚥 Pre-merge checks | ✅ 1 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 2.08% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Title check	❓ Inconclusive	The title is vague and generic, using a domain term without explaining the primary change or scope.	Replace with a more specific title that captures the main intent, e.g., 'Add BulldozerStorageEngine table and Studio dev tool' or 'Introduce Bulldozer DB persistence and visualization UI'.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description check	✅ Passed	The description is mostly complete with overview, scope, and risk assessment, though it lacks formal section structure matching the template.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch bulldozer-db

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Copilot

Pull request overview

Introduces “Bulldozer DB” as a PostgreSQL-backed, materialized-operator system (stored/group/map/filter/limit/concat/sort/lfold/left-join), including schema/migration support and local developer tooling (Bulldozer Studio).

Changes:

Add Bulldozer DB SQL-builder utilities plus multiple table-operator implementations and executors.
Add Prisma schema + migration for BulldozerStorageEngine, plus real-Postgres performance testing and example schema.
Wire up local dev UX: Launchpad entry, backend dev script to run Bulldozer Studio, and add elkjs dependency.

Reviewed changes

Copilot reviewed 28 out of 31 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
pnpm-lock.yaml	Adds `elkjs` and bumps Supabase packages; lockfile updates.
claude/CLAUDE-KNOWLEDGE.md	Documents Bulldozer DB operational/perf findings (JIT + execution mode semantics).
apps/dev-launchpad/public/index.html	Adds a “Bulldozer Studio” service card/port.
apps/backend/src/prisma-client.tsx	Switches `getStackServerApp` to a lazy dynamic import for Neon connection lookup.
apps/backend/src/lib/bulldozer/db/utilities.ts	Adds SQL templating helpers, quoting helpers, and path builders for Bulldozer storage.
apps/backend/src/lib/bulldozer/db/tables/stored-table.ts	Implements base stored table with set/delete and trigger fanout.
apps/backend/src/lib/bulldozer/db/tables/sort-table.ts	Implements treap-backed sort table with temp PL/pgSQL helpers.
apps/backend/src/lib/bulldozer/db/tables/map-table.ts	Implements map via nested flatMap.
apps/backend/src/lib/bulldozer/db/tables/limit-table.ts	Implements per-group limit materialization + incremental recomputation.
apps/backend/src/lib/bulldozer/db/tables/left-join-table.ts	Implements left join materialization with per-side triggers.
apps/backend/src/lib/bulldozer/db/tables/l-fold-table.ts	Implements materialized left-fold with incremental suffix recompute.
apps/backend/src/lib/bulldozer/db/tables/group-by-table.ts	Implements grouping by computed groupKey.
apps/backend/src/lib/bulldozer/db/tables/flat-map-table.ts	Implements flatMap expansion into multiple output rows per source row.
apps/backend/src/lib/bulldozer/db/tables/filter-table.ts	Implements filter via nested flatMap.
apps/backend/src/lib/bulldozer/db/tables/concat-table.ts	Implements virtual concatenation of multiple input tables.
apps/backend/src/lib/bulldozer/db/table-type.ts	Defines the core `Table` type used by operator implementations.
apps/backend/src/lib/bulldozer/db/index.ts	Exposes operator constructors and SQL execution helpers (CTE vs sequential executor).
apps/backend/src/lib/bulldozer/db/index.perf.test.ts	Adds real-Postgres performance regression/load tests.
apps/backend/src/lib/bulldozer/db/example-schema.ts	Adds a composed example schema demonstrating operators in combination.
apps/backend/src/lib/bulldozer/db/bulldozer-sort-helpers-sql.ts	Adds temp SQL/PLpgSQL helpers used by sort table operations.
apps/backend/scripts/run-cron-jobs.ts	Adds an initial startup delay before beginning cron loops.
apps/backend/prisma/schema.prisma	Adds `BulldozerStorageEngine` model and minor formatting adjustments.
apps/backend/prisma/migrations/20260323120000_add_bulldozer_data/tests/ltree-queries.ts	Adds post-migration validation for keyPath/keyPathParent behavior and indexes/FK.
apps/backend/prisma/migrations/20260323120000_add_bulldozer_data/migration.sql	Creates `BulldozerStorageEngine`, seeds root rows, and adds index.
apps/backend/package.json	Adds `elkjs`, adds `run-bulldozer-studio` script, and wires it into `pnpm dev`.
AGENTS.md	Updates lint guidance (including `pnpm -C <package> lint`) and adds an implementation tradeoffs note.
.vscode/settings.json	Adjusts cSpell word list entries.

Files not reviewed (1)

pnpm-lock.yaml: Language not supported

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

apps/backend/src/lib/bulldozer/db/index.ts

apps/backend/src/lib/bulldozer/db/tables/group-by-table.ts

apps/backend/scripts/run-cron-jobs.ts

cursor · 2026-03-27T22:26:29Z

apps/backend/src/lib/bulldozer/db/bulldozer-sort-helpers-sql.ts

+      PERFORM pg_temp.bulldozer_sort_ensure_group(groups_path, current_group_key);
+      EXECUTE format(
+        'SELECT array_agg(jsonb_build_object(''rowIdentifier'', "rowIdentifier", ''rowSortKey'', "rowSortKey", ''rowData'', "rowData") ORDER BY "rowSortKey" ASC, "rowIdentifier" ASC) FROM %I WHERE "groupKey" IS NOT DISTINCT FROM $1',
+        source_table_name


Bulk init ordering inconsistent with custom sort comparison

Medium Severity

bulldozer_sort_bulk_init_from_table hardcodes ORDER BY "rowSortKey" ASC, "rowIdentifier" ASC using default JSONB comparison to build the initial balanced tree. However, the incremental treap operations (find_predecessor, find_successor, split, delete_recursive) all use the custom compare_sort_keys_sql parameter. If a sort table's compareSortKeys function defines an ordering that differs from default JSONB comparison (e.g., descending sort), the bulk-initialized tree will have incorrect BST ordering, causing subsequent inserts, deletes, and traversals to produce wrong results or corrupt the structure.

Additional Locations (1)

apps/backend/src/lib/bulldozer/db/tables/sort-table.ts#L237-L244

apps/backend/scripts/run-bulldozer-studio.ts

greptile-apps · 2026-03-27T22:26:54Z

Greptile Summary

This PR introduces "Bulldozer DB" — a query engine that materializes incremental, view-like computed tables as persistent rows in a single BulldozerStorageEngine PostgreSQL table. The engine uses a tree-structured JSONB array key-path hierarchy with a self-referencing FK (keyPathParent → keyPath) to store all table data in one unified store. It ships 10 relational operators (stored, group-by, flat-map, map, filter, limit, concat, sort, l-fold, left-join), an advisory-lock–protected CTE-based executor, a sequential fallback for sort-helper statements, and a companion dev-only Bulldozer Studio HTTP server for visualizing and mutating tables.

Key changes:

New BulldozerStorageEngine migration + Prisma model with generated keyPathParent column and cascade FK
Core Table<GK, SK, RD> abstraction with init/delete/listGroups/listRowsInGroup/registerRowChangeTrigger
toExecutableSqlStatements — either one giant CTE or sequential temp-table executor (chosen by pg_temp.bulldozer_sort_ sentinel)
Advisory-lock transaction wrapper (toExecutableSqlTransaction) with SET LOCAL jit = off to avoid JIT dominating for large CTE batches
Table type is defined identically in both index.ts and table-type.ts — maintenance risk if they diverge
computeStudioLayout uses Record<string, ...> with dynamic ELK-derived keys instead of Map<string, ...>, violating the prototype-pollution rule
Lazy dynamic import(\"@/stack\") inside prisma-client.tsx to break a circular import on the hot path

Confidence Score: 5/5

Safe to merge — all remaining findings are P2 style/maintenance issues in a dev-only studio script and a type-duplication concern; no production logic is broken.

The core engine logic (CTE executor, incremental trigger chains, path-hierarchy FK, advisory lock, JIT disable) is well-reasoned and thoroughly tested. The only actionable issues are: a duplicate Table type definition that could drift over time, use of Record instead of Map for the ELK positions object in the studio script (rule violation), and unsafe innerHTML interpolation in the studio detail panel — all P2 and confined to a developer-only tool. No data-integrity, security, or correctness bugs were found in the production code paths.

apps/backend/scripts/run-bulldozer-studio.ts (Record with dynamic keys, innerHTML interpolation) and apps/backend/src/lib/bulldozer/db/index.ts (duplicate Table type definition)

Important Files Changed

Filename	Overview
apps/backend/src/lib/bulldozer/db/index.ts	Main barrel/entry point for Bulldozer DB: exports the duplicate `Table` type, all operator factory functions, and the three SQL-generation utilities (toQueryableSqlQuery, toExecutableSqlStatements, toExecutableSqlTransaction). The CTE-vs-sequential executor split is driven by a `pg_temp.bulldozer_sort_` sentinel text scan.
apps/backend/scripts/run-bulldozer-studio.ts	Dev-only HTTP studio: positions map uses Record with dynamic keys (violates Map rule); table.tableId/operator are interpolated into innerHTML without escaping; otherwise correctly passes user-supplied rowIdentifier/rowData through SQL quoting helpers.
apps/backend/prisma/migrations/20260323120000_add_bulldozer_data/migration.sql	Creates BulldozerStorageEngine with a generated keyPathParent column, self-referencing CASCADE FK, and unique index on keyPath. Seeds root hierarchy rows. No NOT NULL changes to existing tables — no data-migration risk.
apps/backend/src/lib/bulldozer/db/tables/stored-table.ts	Base mutable table: setRow / deleteRow produce CTE-compatible statement chains; old-value capture uses a SELECT CTE before the INSERT/DELETE CTE so both see the pre-mutation snapshot. Trigger fan-out is correct.
apps/backend/src/lib/bulldozer/db/tables/left-join-table.ts	Left-join operator: correctly handles stale CTE snapshot by reconstructing new left/right row sets from the change table + the pre-mutation BulldozerStorageEngine state, rather than re-reading post-mutation storage.
apps/backend/src/lib/bulldozer/db/tables/l-fold-table.ts	Materialized left-fold over a sort table; incremental suffix recomputation on source changes; uses declareSortTable internally for treap-backed ordering. Complex but heavily tested.
apps/backend/src/prisma-client.tsx	Converts top-level import of getStackServerApp to a lazy dynamic import inside resolveNeonConnectionString to break a circular dependency; correct and safe in an async context.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A["declareStoredTable\n(setRow / deleteRow)"] -->|"registerRowChangeTrigger\n(changesTable)"| B["declareGroupByTable\n/ declareFilterTable\n/ declareMapTable\n/ declareFlatMapTable\n/ declareLimitTable\n/ declareConcatTable"]
    B -->|"registerRowChangeTrigger"| C["declareSortTable\n(treap via pg_temp helpers)"]
    C -->|"registerRowChangeTrigger"| D["declareLFoldTable\n(suffix recompute)"]
    B -->|"registerRowChangeTrigger"| E["declareLeftJoinTable\n(recompute affected groups)"]

    subgraph Execution ["SQL Execution"]
        F{"requiresSortHelpers?\n(pg_temp.bulldozer_sort_)"}
        F -->|"No"| G["Giant CTE executor\nWITH cte1 AS (...), cte2 AS (...)\nSELECT 1;"]
        F -->|"Yes"| H["Sequential executor\nCREATE TEMP TABLE ... ON COMMIT DROP\nper statement with outputName"]
    end

    subgraph Storage ["BulldozerStorageEngine (single table)"]
        I["keyPath: jsonb[] — tree node address\nkeyPathParent: jsonb[] GENERATED (cascade FK)\nvalue: jsonb"]
    end

    G --> Storage
    H --> Storage

    subgraph Transaction ["toExecutableSqlTransaction"]
        J["BEGIN\nSET LOCAL jit = off\nSELECT pg_advisory_xact_lock(7857391)\n... statements ...\nCOMMIT"]
    end

_{Reviews (1): Last reviewed commit: "Merge branch 'dev' into bulldozer-db" | Re-trigger Greptile}

apps/backend/src/lib/bulldozer/db/index.ts

apps/backend/scripts/run-bulldozer-studio.ts

coderabbitai

Actionable comments posted: 13

🧹 Nitpick comments (7)

apps/backend/prisma/schema.prisma (1)
1250-1258: Consider adding createdAt/updatedAt timestamps for observability.

Most models in this schema include createdAt and updatedAt fields. If omitted intentionally for performance reasons (high-frequency writes), that's acceptable—just flagging for consistency with other models.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@apps/backend/prisma/schema.prisma` around lines 1250 - 1258, The
BulldozerStorageEngine model is missing createdAt/updatedAt timestamps used
elsewhere for observability; add createdAt DateTime `@default`(now()) and
updatedAt DateTime `@updatedAt` fields to the BulldozerStorageEngine model
(referencing the model name and field identifiers) so new records record
creation time and updates auto-update, or if omission is intentional for
performance, add a comment in the model noting that rationale instead of leaving
them out.
apps/backend/src/lib/bulldozer/db/index.ts (1)
11-36: Keep Table in one place.

apps/backend/src/lib/bulldozer/db/table-type.ts already defines this public interface. Duplicating it here gives the module two sources of truth, which is easy to drift the next time the contract changes. Re-export the canonical type instead.
♻️ Suggested cleanup
-import type { Json, RowData, RowIdentifier, SqlExpression, SqlQuery, SqlStatement, TableId } from "./utilities";
+import type { SqlQuery, SqlStatement } from "./utilities";
 import { quoteSqlIdentifier } from "./utilities";
+export type { Table } from "./table-type";
 
-// ====== Table implementations ======
-// IMPORTANT NOTE: For every new table implementation, we should also add tests (unit, fuzzing, & perf; including an entry in the "hundreds of thousands" perf test), an example in the example schema, and support in Bulldozer Studio.
-
-export type Table<GK extends Json, SK extends Json, RD extends RowData> = {
-  tableId: TableId,
-  inputTables: Table<any, any, any>[],
-  debugArgs: Record<string, unknown>,
-  ...
-};
+// ====== Table implementations ======
+// IMPORTANT NOTE: For every new table implementation, we should also add tests (unit, fuzzing, & perf; including an entry in the "hundreds of thousands" perf test), an example in the example schema, and support in Bulldozer Studio.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@apps/backend/src/lib/bulldozer/db/index.ts` around lines 11 - 36, The Table
type is duplicated here; remove the local duplicate definition and re-export the
canonical interface from the existing declaration in table-type.ts instead.
Replace the local type block in this file with a re-export of the Table symbol
from the module that defines it (table-type.ts) so there is a single source of
truth, and update any local references in this file to use that re-exported
Table type (keep function names like listGroups, listRowsInGroup,
compareGroupKeys, compareSortKeys, init, delete, isInitialized,
registerRowChangeTrigger unchanged). Ensure exports remain the same shape so
other modules importing Table from this file keep working.
apps/backend/src/lib/bulldozer/db/tables/concat-table.ts (1)
32-32: rawExpression bypasses SQL template safety - use with caution.

This helper creates an SqlExpression from a raw string without the template literal safety checks. While necessary for programmatic SQL building, it shifts injection-safety responsibility to callers.

Consider adding a brief comment noting that callers must ensure the input is safe, or rename to unsafeRawExpression to make the contract explicit.
Suggested documentation
-  const rawExpression = <T>(sql: string): SqlExpression<T> => ({ type: "expression", sql });
+  // SAFETY: Callers must ensure `sql` is sanitized or derived from trusted sources
+  const rawExpression = <T>(sql: string): SqlExpression<T> => ({ type: "expression", sql });
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@apps/backend/src/lib/bulldozer/db/tables/concat-table.ts` at line 32, The
function rawExpression currently constructs an SqlExpression<T> from an
arbitrary string and thus bypasses template literal SQL safety; update the
implementation by either renaming rawExpression to unsafeRawExpression to make
the unsafe contract explicit or add a clear inline comment above rawExpression
calling out that callers are responsible for sanitizing inputs to avoid SQL
injection, referencing the SqlExpression type and any callers that use
rawExpression so they can be audited for safety.
apps/backend/src/lib/bulldozer/db/tables/flat-map-table.ts (2)
213-214: Minor: Extra whitespace in compareSortKeys expression.

The expression sqlExpression` 0 ` has a leading space. This is functionally harmless in SQL but inconsistent with concat-table.ts line 165 which uses sqlExpression`0`.
Suggested fix for consistency
-    compareSortKeys: (a, b) => sqlExpression` 0 `,
+    compareSortKeys: (a, b) => sqlExpression`0`,
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@apps/backend/src/lib/bulldozer/db/tables/flat-map-table.ts` around lines 213
- 214, The compareSortKeys assignment uses sqlExpression with an extra leading
space (sqlExpression` 0 `); update the compareSortKeys lambda to use
sqlExpression`0` (no leading/trailing space) so it matches the style used in
concat-table.ts and removes the inconsistent whitespace around the literal;
locate the compareSortKeys definition and replace the template literal
accordingly.
29-30: Position-based row identifiers may cause unnecessary churn.

The expanded row identifier "sourceId:flatIndex" uses ordinal position from WITH ORDINALITY. If the mapper output shifts (e.g., [A,B,C] → [A,X,B,C]), rows at indices 2+ get new identifiers even if their content is unchanged. This triggers downstream change propagation for stable content.

This may be intentional for simplicity, but if stable content tracking matters, consider using content-based deduplication or stable keys from the mapper.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@apps/backend/src/lib/bulldozer/db/tables/flat-map-table.ts` around lines 29 -
30, The current createExpandedRowIdentifier builds identifiers by concatenating
sourceRowIdentifier and the ordinality-based flatIndex (generated via WITH
ORDINALITY), which causes identifier churn when mapper output shifts; change
this to produce stable identifiers by deriving a content-based key instead of
using positional index—e.g., compute a deterministic hash of the mapped element
(or use a stable key field provided by the mapper) and combine that with
sourceRowIdentifier; update createExpandedRowIdentifier to accept a
content-based SqlExpression (e.g., mappedElement or mappedElementHash) instead
of flatIndex and ensure callers that invoke createExpandedRowIdentifier supply
the stable key/hash rather than the ordinal.
apps/backend/src/lib/bulldozer/db/utilities.ts (2)
40-42: Consider additional character escaping for edge cases.

The single-quote escaping is correct for standard PostgreSQL string literals. However, null bytes (\0) and certain Unicode sequences could potentially cause issues in some configurations. For a storage engine that may handle arbitrary user data, consider whether additional sanitization is warranted.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@apps/backend/src/lib/bulldozer/db/utilities.ts` around lines 40 - 42,
quoteSqlStringLiteral currently only escapes single quotes which can miss edge
cases like null bytes and problematic Unicode; update it to normalize and
explicitly handle such characters. Modify the quoteSqlStringLiteral function to:
1) reject or encode null bytes (e.g., throw on '\0' or replace with an explicit
escape sequence) and 2) normalize input (e.g., NFC) and optionally escape or
validate noncharacter code points and surrogate pairs before building the
SqlExpression; ensure the resulting sql string still uses the existing
single-quote doubling logic and that the function signature
(quoteSqlStringLiteral and the returned SqlExpression<string>) remains
unchanged.
78-87: Add a docstring clarifying the null sort key semantics.

This function lacks documentation explaining its behavior. When either bound is exclusive and not unbounded, it returns false (no matches), which is mathematically correct—null cannot satisfy > x or < x—but may surprise callers. A docstring explaining this edge case would clarify the intentional design.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@apps/backend/src/lib/bulldozer/db/utilities.ts` around lines 78 - 87, Add a
docstring to singleNullSortKeyRangePredicate that explains it computes whether a
NULL sort key can satisfy the provided range: document the options parameters
(start, end, startInclusive, endInclusive), state that when a bound is the
literal "start" or "end" it means unbounded, and explicitly call out the edge
case where either bound is exclusive and not unbounded—this function will return
false because NULL cannot satisfy strict > or < comparisons; also mention the
returned SqlExpression<boolean> semantics (always-true or always-false SQL
literal).

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@apps/backend/scripts/run-bulldozer-studio.ts`:
- Around line 2038-2046: Reject raw delete requests that target the reserved
root paths by validating pathSegments before running retryTransaction: if
pathSegments is an empty array or exactly ["table"] then return a client error
(400) / throw an appropriate API error and do not call retryTransaction or
tx.$executeRawUnsafe. Add this check immediately after pathSegments is extracted
(the variable named pathSegments in the POST "/api/raw/delete" handler) and
include a clear error message; leave all existing DB deletion logic
(keyPathSqlLiteral, retryTransaction, globalPrismaClient, tx.$executeRawUnsafe)
unchanged otherwise.
- Around line 2022-2046: Both /api/raw/upsert and /api/raw/delete perform direct
mutations on "BulldozerStorageEngine" without acquiring the same advisory lock
used by executeStatements(), allowing interleaved tree edits; update both
request handlers to acquire the same Bulldozer advisory lock before running the
retryTransaction/globalPrismaClient transaction (i.e., call the same lock
acquisition used by executeStatements() at the start of the transaction scope),
then perform the existing tx.$executeRawUnsafe(...) logic (which uses
keyPathSqlLiteral and quoteSqlJsonbLiteral) and release automatically at
transaction end so raw upserts/deletes are serialized with other tree mutations.
- Around line 136-141: The call in executeStatements currently passes the whole
multi-statement script from toExecutableSqlStatements() into
tx.$executeRawUnsafe which fails because Prisma accepts a single statement per
call; change executeStatements to split the script returned by
toExecutableSqlStatements() into individual SQL statements (e.g., by
semicolon-aware parsing or the existing splitter utility) and then run each
statement sequentially inside the retryTransaction using tx.$executeRawUnsafe
for each one (keeping the SET LOCAL jit and SELECT pg_advisory_xact_lock calls
as individual executions); alternatively, route the full script through a driver
that supports multi-statement scripts if present, but prefer per-statement
execution to fix failures when pg_temp.bulldozer_sort_* is involved.

In `@apps/backend/scripts/run-cron-jobs.ts`:
- Line 28: The inline comment for the wait call is misleading: update the
comment next to the await wait(30_000) call in run-cron-jobs.ts so it reflects
the actual delay (e.g., "Wait 30 seconds to make sure the server is fully
started") or, if the intent was a shorter pause, change the numeric literal
(30_000) to the intended duration (e.g., 3_000) and keep the comment consistent;
ensure the comment and the await wait(...) value match.

In `@apps/backend/src/lib/bulldozer/db/bulldozer-sort-helpers-sql.ts`:
- Around line 573-616: bulldozer_sort_bulk_init_from_table currently hard-codes
ORDER BY "rowSortKey" ASC which conflicts with custom comparators; update the
function signature to accept compare_sort_keys_sql and use it in the dynamic
SELECT (instead of the fixed 'ORDER BY "rowSortKey" ASC, "rowIdentifier" ASC')
so ordered_rows respects the provided comparator, or alternatively change the
implementation to build the initial tree by iterating rows and calling
pg_temp.bulldozer_sort_insert for each row (keeping
pg_temp.bulldozer_sort_ensure_group, pg_temp.bulldozer_sort_build_balanced_group
and pg_temp.bulldozer_sort_put_group_metadata interactions intact) so ordering
logic matches bulldozer_sort_insert/bulldozer_sort_delete.

In `@apps/backend/src/lib/bulldozer/db/example-schema.ts`:
- Around line 103-108: The filter used to build accountEntriesWithCounterparty
currently uses the JSON operator -> which leaves JSON null values as a
non-SQL-NULL and lets rows with counterparty: null through; update the predicate
in the declareFilterTable call (accountEntriesWithCounterparty, fromTable
entriesByAccount) to use the text operator ->> instead (e.g.
("rowData"->>'counterparty') IS NOT NULL) so JSON null and missing keys are
treated as SQL NULL and correctly filtered out.

In `@apps/backend/src/lib/bulldozer/db/index.ts`:
- Around line 54-67: In toExecutableSqlStatements, always use the sequential
temp-table execution path (the sequential temp-table approach currently guarded
by requiresSortSequentialExecutor) instead of the sibling-CTE branch; remove the
conditional branch that returns the sibling-CTE string and ensure the function
builds and returns the sequential executor SQL (the temp-table / sequential
execution block that currently lives in the other branch), eliminating
requiresSortSequentialExecutor or making it always true so data-modifying
statements (DELETE/INSERT) execute in order and can observe prior changes;
update any related variables
(requiresSortHelpers/requiresSortSequentialExecutor) and tests accordingly.

In `@apps/backend/src/lib/bulldozer/db/tables/group-by-table.ts`:
- Around line 187-188: The compareGroupKeys() (and the duplicate at lines
~271-283) currently returns a constant 0 causing all groups to be treated equal;
replace the stub with a real SQL JSONB comparator and thread it through
listGroups() by updating singleNullSortKeyRangePredicate() to use the same
documented JSONB default comparator, and similarly implement compareSortKeys()
to perform proper JSONB sort-key comparison; specifically, update the
compareGroupKeys and compareSortKeys functions to emit a sqlExpression that
compares JSONB values deterministically (the same expression used by
singleNullSortKeyRangePredicate/listGroups) so group range queries and
downstream operators get a correct ordering.
- Around line 16-24: The declareGroupByTable signature and implementation are
unsound: NewRD is declared but the function always returns OldRD, fromTable
erases the sort-key type with any, and compareGroupKeys returns 0 making all
groups equal. Fix by changing the generic parameters to clearly separate input
row type (OldRD) and output/group row type (NewRD), remove the any by preserving
fromTable's sort-key generic (e.g., Table<GK, SK, OldRD>), and update
declareGroupByTable to return Table<GK, SK, NewRD> (or appropriate nullability)
while transforming OldRD -> NewRD using the provided groupBy mapper; implement
compareGroupKeys to perform a real comparison based on GK (or delegate to a
provided comparator) and throw or assert if assumptions about key shape fail
instead of silently returning 0. Ensure you update uses of fromTable, groupBy,
and compareGroupKeys to match the new generics and transformation contract.

In `@apps/backend/src/lib/bulldozer/db/tables/l-fold-table.ts`:
- Around line 716-740: listRowsInGroup currently orders results with a
hard-coded "ORDER BY ... rowSortKey ASC" which can disagree with the comparator
used when folding (options.fromTable.compareSortKeys); update listRowsInGroup to
use the same ordering expression that the fold computation uses (or an explicit
persisted ordinal) instead of raw "rowSortKey ASC". Specifically, replace the
ASC ordering in both query branches with the sort expression derived from
options.fromTable.compareSortKeys (or a stored emitted ordinal), ensuring the
sortRangePredicate and ordering use the identical comparator logic; refer to
listRowsInGroup, sortRangePredicate, getGroupRowsPath, groupsPath, rowSortKey
and options.fromTable.compareSortKeys to locate where to apply the change.

In `@apps/backend/src/lib/bulldozer/db/tables/left-join-table.ts`:
- Around line 435-457: The queries in listRowsInGroup lack ORDER BY, causing
non-deterministic row order; update both branches of listRowsInGroup to append
deterministic ORDER BY clauses: for the single-group branch (the branch with
only "row") ORDER BY the computed rowIdentifier (the
("row"."keyPath"[cardinality("row"."keyPath")] #>> '{}') expression), and for
the multi-group branch ORDER BY groupKey then rowIdentifier (groupPath keyPath
expression then ("rows"."keyPath"[cardinality("rows"."keyPath")] #>> '{}') ), so
that compareSortKeys' constant sort key is disambiguated by these stable
tie-breakers. Ensure the ORDER BY uses the same expressions/aliases used in the
SELECT so they remain correct after any refactor.

In `@apps/backend/src/lib/bulldozer/db/tables/limit-table.ts`:
- Around line 118-124: The ranking/read queries in limit-table.ts use raw JSONB
ordering ("rows"."rowSortKey") which ignores the table's declared comparator;
update the ORDER BY clauses (the ones producing "rank" and the subsequent read
queries that reference "rankedRows") to use the table's comparator by invoking
options.fromTable.compareSortKeys(...) (or the equivalent helper that emits the
comparator-aware SQL expression) instead of ordering by the raw JSONB column,
and ensure the tie-breaker "rows"."rowIdentifier" remains; apply the same fix to
the other occurrences noted (the blocks around lines 138-145, 277-283, 376-404)
so that normalizedLimit and rankedRows reflect the comparator-aware ordering.

In `@apps/backend/src/lib/bulldozer/db/tables/sort-table.ts`:
- Around line 312-334: The recursive CTE uses sort-key bounds
(options.fromTable.compareGroupKeys) to prune groups in the all-groups path (the
"groupMetadatas" block), which can incorrectly drop groups or compare the wrong
JSON shape when groupKey is omitted; update the WHERE clauses that inject
start/end comparisons so they skip calling options.fromTable.compareGroupKeys
(use a no-op condition like 1 = 1) when the query is for the all-groups path
(i.e., when groupKey is not present/omitted), and ensure any actual sort-key
filtering is deferred to the existing sortRangePredicate()/sort-range handling
later; refer to "groupMetadatas", "groupPath", "groupMetadata", and
options.fromTable.compareGroupKeys to locate and change the two conditional
SQLExpression injections for start and end.

---

Nitpick comments:
In `@apps/backend/prisma/schema.prisma`:
- Around line 1250-1258: The BulldozerStorageEngine model is missing
createdAt/updatedAt timestamps used elsewhere for observability; add createdAt
DateTime `@default`(now()) and updatedAt DateTime `@updatedAt` fields to the
BulldozerStorageEngine model (referencing the model name and field identifiers)
so new records record creation time and updates auto-update, or if omission is
intentional for performance, add a comment in the model noting that rationale
instead of leaving them out.

In `@apps/backend/src/lib/bulldozer/db/index.ts`:
- Around line 11-36: The Table type is duplicated here; remove the local
duplicate definition and re-export the canonical interface from the existing
declaration in table-type.ts instead. Replace the local type block in this file
with a re-export of the Table symbol from the module that defines it
(table-type.ts) so there is a single source of truth, and update any local
references in this file to use that re-exported Table type (keep function names
like listGroups, listRowsInGroup, compareGroupKeys, compareSortKeys, init,
delete, isInitialized, registerRowChangeTrigger unchanged). Ensure exports
remain the same shape so other modules importing Table from this file keep
working.

In `@apps/backend/src/lib/bulldozer/db/tables/concat-table.ts`:
- Line 32: The function rawExpression currently constructs an SqlExpression<T>
from an arbitrary string and thus bypasses template literal SQL safety; update
the implementation by either renaming rawExpression to unsafeRawExpression to
make the unsafe contract explicit or add a clear inline comment above
rawExpression calling out that callers are responsible for sanitizing inputs to
avoid SQL injection, referencing the SqlExpression type and any callers that use
rawExpression so they can be audited for safety.

In `@apps/backend/src/lib/bulldozer/db/tables/flat-map-table.ts`:
- Around line 213-214: The compareSortKeys assignment uses sqlExpression with an
extra leading space (sqlExpression` 0 `); update the compareSortKeys lambda to
use sqlExpression`0` (no leading/trailing space) so it matches the style used in
concat-table.ts and removes the inconsistent whitespace around the literal;
locate the compareSortKeys definition and replace the template literal
accordingly.
- Around line 29-30: The current createExpandedRowIdentifier builds identifiers
by concatenating sourceRowIdentifier and the ordinality-based flatIndex
(generated via WITH ORDINALITY), which causes identifier churn when mapper
output shifts; change this to produce stable identifiers by deriving a
content-based key instead of using positional index—e.g., compute a
deterministic hash of the mapped element (or use a stable key field provided by
the mapper) and combine that with sourceRowIdentifier; update
createExpandedRowIdentifier to accept a content-based SqlExpression (e.g.,
mappedElement or mappedElementHash) instead of flatIndex and ensure callers that
invoke createExpandedRowIdentifier supply the stable key/hash rather than the
ordinal.

In `@apps/backend/src/lib/bulldozer/db/utilities.ts`:
- Around line 40-42: quoteSqlStringLiteral currently only escapes single quotes
which can miss edge cases like null bytes and problematic Unicode; update it to
normalize and explicitly handle such characters. Modify the
quoteSqlStringLiteral function to: 1) reject or encode null bytes (e.g., throw
on '\0' or replace with an explicit escape sequence) and 2) normalize input
(e.g., NFC) and optionally escape or validate noncharacter code points and
surrogate pairs before building the SqlExpression; ensure the resulting sql
string still uses the existing single-quote doubling logic and that the function
signature (quoteSqlStringLiteral and the returned SqlExpression<string>) remains
unchanged.
- Around line 78-87: Add a docstring to singleNullSortKeyRangePredicate that
explains it computes whether a NULL sort key can satisfy the provided range:
document the options parameters (start, end, startInclusive, endInclusive),
state that when a bound is the literal "start" or "end" it means unbounded, and
explicitly call out the edge case where either bound is exclusive and not
unbounded—this function will return false because NULL cannot satisfy strict >
or < comparisons; also mention the returned SqlExpression<boolean> semantics
(always-true or always-false SQL literal).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 6843e08f-a066-48c7-8364-28bebef70f53

📥 Commits

Reviewing files that changed from the base of the PR and between b8ea06f and 037d20f.

⛔ Files ignored due to path filters (1)

pnpm-lock.yaml is excluded by !**/pnpm-lock.yaml

📒 Files selected for processing (30)

.vscode/settings.json
AGENTS.md
apps/backend/package.json
apps/backend/prisma/migrations/20260323120000_add_bulldozer_data/migration.sql
apps/backend/prisma/migrations/20260323120000_add_bulldozer_data/tests/ltree-queries.ts
apps/backend/prisma/schema.prisma
apps/backend/scripts/run-bulldozer-studio.ts
apps/backend/scripts/run-cron-jobs.ts
apps/backend/src/lib/bulldozer/bulldozer-schema.ts
apps/backend/src/lib/bulldozer/db/bulldozer-sort-helpers-sql.ts
apps/backend/src/lib/bulldozer/db/example-schema.ts
apps/backend/src/lib/bulldozer/db/index.fuzz.test.ts
apps/backend/src/lib/bulldozer/db/index.perf.test.ts
apps/backend/src/lib/bulldozer/db/index.test.ts
apps/backend/src/lib/bulldozer/db/index.ts
apps/backend/src/lib/bulldozer/db/table-type.ts
apps/backend/src/lib/bulldozer/db/tables/concat-table.ts
apps/backend/src/lib/bulldozer/db/tables/filter-table.ts
apps/backend/src/lib/bulldozer/db/tables/flat-map-table.ts
apps/backend/src/lib/bulldozer/db/tables/group-by-table.ts
apps/backend/src/lib/bulldozer/db/tables/l-fold-table.ts
apps/backend/src/lib/bulldozer/db/tables/left-join-table.ts
apps/backend/src/lib/bulldozer/db/tables/limit-table.ts
apps/backend/src/lib/bulldozer/db/tables/map-table.ts
apps/backend/src/lib/bulldozer/db/tables/sort-table.ts
apps/backend/src/lib/bulldozer/db/tables/stored-table.ts
apps/backend/src/lib/bulldozer/db/utilities.ts
apps/backend/src/prisma-client.tsx
apps/dev-launchpad/public/index.html
claude/CLAUDE-KNOWLEDGE.md

apps/backend/scripts/run-bulldozer-studio.ts

apps/backend/scripts/run-cron-jobs.ts

apps/backend/src/lib/bulldozer/db/bulldozer-sort-helpers-sql.ts

apps/backend/src/lib/bulldozer/db/tables/group-by-table.ts

coderabbitai · 2026-03-27T22:36:48Z

apps/backend/src/lib/bulldozer/db/tables/l-fold-table.ts

+    listRowsInGroup: ({ groupKey, start, end, startInclusive, endInclusive }) => groupKey ? sqlQuery`
+      SELECT
+        ("row"."keyPath"[cardinality("row"."keyPath")] #>> '{}') AS rowIdentifier,
+        "row"."value"->'rowSortKey' AS rowSortKey,
+        "row"."value"->'rowData' AS rowData
+      FROM "BulldozerStorageEngine" AS "row"
+      WHERE "row"."keyPathParent" = ${getGroupRowsPath(groupKey)}::jsonb[]
+        AND ${sortRangePredicate(sqlExpression`"row"."value"->'rowSortKey'`, { start, end, startInclusive, endInclusive })}
+      ORDER BY rowSortKey ASC, rowIdentifier ASC
+    ` : sqlQuery`
+      SELECT
+        "groupPath"."keyPath"[cardinality("groupPath"."keyPath")] AS groupKey,
+        ("rows"."keyPath"[cardinality("rows"."keyPath")] #>> '{}') AS rowIdentifier,
+        "rows"."value"->'rowSortKey' AS rowSortKey,
+        "rows"."value"->'rowData' AS rowData
+      FROM "BulldozerStorageEngine" AS "groupPath"
+      INNER JOIN "BulldozerStorageEngine" AS "groupRowsPath"
+        ON "groupRowsPath"."keyPathParent" = "groupPath"."keyPath"
+      INNER JOIN "BulldozerStorageEngine" AS "rows"
+        ON "rows"."keyPathParent" = "groupRowsPath"."keyPath"
+      WHERE "groupPath"."keyPathParent" = ${groupsPath}::jsonb[]
+        AND "groupRowsPath"."keyPath"[cardinality("groupRowsPath"."keyPath")] = to_jsonb('rows'::text)
+        AND ${sortRangePredicate(sqlExpression`"rows"."value"->'rowSortKey'`, { start, end, startInclusive, endInclusive })}
+      ORDER BY groupKey ASC, rowSortKey ASC, rowIdentifier ASC
+    `,


⚠️ Potential issue | 🟠 Major

Don't re-sort folded rows with raw column ordering.

The fold is computed from options.fromTable.compareSortKeys, but the read path ignores that comparator and orders directly on the stored rowSortKey value. If a caller supplies a comparator that does not match the column's default ascending order, listRowsInGroup() will expose a different sequence than the one used during recomputation. Read through the source sort table's order, or persist an explicit ordinal for emitted rows, instead of ORDER BY rowSortKey ASC. (postgresql.org)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@apps/backend/src/lib/bulldozer/db/tables/l-fold-table.ts` around lines 716 - 740, listRowsInGroup currently orders results with a hard-coded "ORDER BY ... rowSortKey ASC" which can disagree with the comparator used when folding (options.fromTable.compareSortKeys); update listRowsInGroup to use the same ordering expression that the fold computation uses (or an explicit persisted ordinal) instead of raw "rowSortKey ASC". Specifically, replace the ASC ordering in both query branches with the sort expression derived from options.fromTable.compareSortKeys (or a stored emitted ordinal), ensuring the sortRangePredicate and ordering use the identical comparator logic; refer to listRowsInGroup, sortRangePredicate, getGroupRowsPath, groupsPath, rowSortKey and options.fromTable.compareSortKeys to locate where to apply the change.

apps/backend/src/lib/bulldozer/db/tables/left-join-table.ts

coderabbitai · 2026-03-27T22:36:48Z

apps/backend/src/lib/bulldozer/db/tables/limit-table.ts

+            row_number() OVER (
+              PARTITION BY "rows"."groupKey"
+              ORDER BY "rows"."rowSortKey" ASC, "rows"."rowIdentifier" ASC
+            ) AS "rank"
+          FROM ${quoteSqlIdentifier(oldGroupRowsTableName)} AS "rows"
+        ) AS "rankedRows"
+        WHERE "rankedRows"."rank" <= ${normalizedLimit}


⚠️ Potential issue | 🟠 Major

limit is not honoring the source table's declared sort order.

These ranking and read queries order by raw JSONB (rowSortKey ASC) instead of options.fromTable.compareSortKeys(...). The operator is therefore only correct when the table's comparator happens to match PostgreSQL's native jsonb ordering; descending or custom sort semantics will materialize and return the wrong top-N. If natural JSONB ordering is the intended constraint, the API needs to state that explicitly.

Also applies to: 138-145, 277-283, 376-404

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@apps/backend/src/lib/bulldozer/db/tables/limit-table.ts` around lines 118 - 124, The ranking/read queries in limit-table.ts use raw JSONB ordering ("rows"."rowSortKey") which ignores the table's declared comparator; update the ORDER BY clauses (the ones producing "rank" and the subsequent read queries that reference "rankedRows") to use the table's comparator by invoking options.fromTable.compareSortKeys(...) (or the equivalent helper that emits the comparator-aware SQL expression) instead of ordering by the raw JSONB column, and ensure the tie-breaker "rows"."rowIdentifier" remains; apply the same fix to the other occurrences noted (the blocks around lines 138-145, 277-283, 376-404) so that normalizedLimit and rankedRows reflect the comparator-aware ordering.

apps/backend/src/lib/bulldozer/db/tables/sort-table.ts

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

There are 4 total unresolved issues (including 2 from previous reviews).

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

apps/backend/src/lib/bulldozer/db/example-schema.ts

cursor · 2026-03-27T23:07:42Z

apps/backend/scripts/run-bulldozer-studio.ts

+
+function quoteSqlJsonbLiteral(input: unknown): string {
+  return `${quoteSqlStringLiteral(JSON.stringify(input))}::jsonb`;
+}


Duplicated SQL quoting functions across two files

Low Severity

quoteSqlStringLiteral, quoteSqlIdentifier, and quoteSqlJsonbLiteral are duplicated between run-bulldozer-studio.ts and utilities.ts with identical logic. The studio already imports from the bulldozer library (toExecutableSqlStatements, toQueryableSqlQuery), so these quoting functions could be reused by extracting the .sql property from the library's typed return values instead of re-implementing them.

Additional Locations (1)

apps/backend/src/lib/bulldozer/db/utilities.ts#L33-L45

apps/backend/prisma/schema.prisma

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (2)

apps/backend/src/lib/bulldozer/db/index.ts (2)
13-13: Add comment explaining any usage per coding guidelines.

Line 13 uses Table<any, any, any>[] without explanation. Per coding guidelines, when using any, leave a comment explaining why it's necessary and how type safety is preserved.
Suggested documentation
-  inputTables: Table<any, any, any>[],
+  // `any` is required here because input tables can have heterogeneous generic parameters;
+  // TypeScript lacks existential types to express "Table with some GK/SK/RD". Type safety is
+  // maintained because these tables are only used for dependency tracking and lifecycle management.
+  inputTables: Table<any, any, any>[],
As per coding guidelines: "Try to avoid the any type. Whenever you need to use any, leave a comment explaining why you're using it."
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@apps/backend/src/lib/bulldozer/db/index.ts` at line 13, Add an inline comment
next to the inputTables parameter that explains why Table<any, any, any>[] uses
any (e.g., dynamic table schemas, third-party types, or runtime-validated
shapes) and describe how type safety is preserved (runtime validation, narrowing
later in functions like the code that processes inputTables, or use of
discriminated unions). Reference the symbol inputTables and the Table<any, any,
any> type in your comment so reviewers see the justification and the mitigation
strategy.
93-93: Consider validating statementTimeout format before interpolation.

Direct string interpolation into SQL without validation could cause issues if unexpected input is passed. While SET LOCAL statement_timeout will likely reject malformed values, validating the format defensively aligns with "fail early, fail loud" principles.
Suggested validation
 export function toExecutableSqlTransaction(statements: SqlStatement[], options: { statementTimeout?: string } = {}): string {
+  if (options.statementTimeout != null && !/^\d+\s?(ms|s|min|h)?$/.test(options.statementTimeout)) {
+    throw new Error(`Invalid statement_timeout format: ${options.statementTimeout}`);
+  }
   return deindent`
     BEGIN;
As per coding guidelines: "Fail early, fail loud. Fail fast with an error instead of silently continuing."
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@apps/backend/src/lib/bulldozer/db/index.ts` at line 93, Validate
options.statementTimeout before interpolating into the SQL string: before the
template that builds `${options.statementTimeout ? \`SET LOCAL statement_timeout
= '\${options.statementTimeout}';\` : ""}`, add a defensive check that
options.statementTimeout is a non-empty string matching an allowed format (for
example /^\d+(ms|s|min|h)?$/ or whichever units your DB accepts) and throw a
clear error (e.g., new Error("Invalid statementTimeout format")) if it fails;
keep the rest of the interpolation unchanged so only validated values are used.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@apps/backend/src/lib/bulldozer/db/tables/limit-table.ts`:
- Line 374: The ORDER BY in listRowsInGroup is using raw JSONB ordering (ORDER
BY rowSortKey ASC) but the WHERE bounds use options.fromTable.compareSortKeys,
causing inconsistent ordering; update the query so ordering uses the same
comparator expression used for bounds—e.g., project the comparator result (or
call the same compareSortKeys expression/function) in the SELECT as an alias and
ORDER BY that alias, or if the comparator matches JSONB semantics, document that
constraint instead; ensure references to rowSortKey and rowIdentifier remain for
stable tie-breaking.

---

Nitpick comments:
In `@apps/backend/src/lib/bulldozer/db/index.ts`:
- Line 13: Add an inline comment next to the inputTables parameter that explains
why Table<any, any, any>[] uses any (e.g., dynamic table schemas, third-party
types, or runtime-validated shapes) and describe how type safety is preserved
(runtime validation, narrowing later in functions like the code that processes
inputTables, or use of discriminated unions). Reference the symbol inputTables
and the Table<any, any, any> type in your comment so reviewers see the
justification and the mitigation strategy.
- Line 93: Validate options.statementTimeout before interpolating into the SQL
string: before the template that builds `${options.statementTimeout ? \`SET
LOCAL statement_timeout = '\${options.statementTimeout}';\` : ""}`, add a
defensive check that options.statementTimeout is a non-empty string matching an
allowed format (for example /^\d+(ms|s|min|h)?$/ or whichever units your DB
accepts) and throw a clear error (e.g., new Error("Invalid statementTimeout
format")) if it fails; keep the rest of the interpolation unchanged so only
validated values are used.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 2f44a517-e596-4dcb-bfc4-95ac1d1151be

📥 Commits

Reviewing files that changed from the base of the PR and between 2403c17 and d3a2daa.

📒 Files selected for processing (12)

apps/backend/src/lib/bulldozer/db/index.perf.test.ts
apps/backend/src/lib/bulldozer/db/index.ts
apps/backend/src/lib/bulldozer/db/tables/concat-table.ts
apps/backend/src/lib/bulldozer/db/tables/filter-table.ts
apps/backend/src/lib/bulldozer/db/tables/flat-map-table.ts
apps/backend/src/lib/bulldozer/db/tables/group-by-table.ts
apps/backend/src/lib/bulldozer/db/tables/l-fold-table.ts
apps/backend/src/lib/bulldozer/db/tables/left-join-table.ts
apps/backend/src/lib/bulldozer/db/tables/limit-table.ts
apps/backend/src/lib/bulldozer/db/tables/map-table.ts
apps/backend/src/lib/bulldozer/db/tables/sort-table.ts
apps/backend/src/lib/bulldozer/db/tables/stored-table.ts

✅ Files skipped from review due to trivial changes (2)

apps/backend/src/lib/bulldozer/db/tables/l-fold-table.ts
apps/backend/src/lib/bulldozer/db/index.perf.test.ts

🚧 Files skipped from review as they are similar to previous changes (1)

apps/backend/src/lib/bulldozer/db/tables/group-by-table.ts

coderabbitai · 2026-03-28T02:08:13Z

apps/backend/src/lib/bulldozer/db/tables/limit-table.ts

+              ? sqlExpression`${options.fromTable.compareSortKeys(sqlExpression`"row"."value"->'rowSortKey'`, end)} <= 0`
+              : sqlExpression`${options.fromTable.compareSortKeys(sqlExpression`"row"."value"->'rowSortKey'`, end)} < 0`
+        }
+      ORDER BY rowSortKey ASC, rowIdentifier ASC


⚠️ Potential issue | 🟡 Minor

listRowsInGroup output ordering is inconsistent with bounds filtering.

The WHERE clause correctly uses options.fromTable.compareSortKeys for start/end bounds (lines 364-372), but the final ORDER BY rowSortKey ASC uses raw JSONB ordering. This inconsistency could cause unexpected row ordering when the comparator differs from native JSONB comparison.

Suggested fix

Consider using a consistent ordering approach. If the comparator is expected to match JSONB ordering, document that constraint. Otherwise, the ORDER BY should use a subquery or expression that applies the comparator logic.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@apps/backend/src/lib/bulldozer/db/tables/limit-table.ts` at line 374, The ORDER BY in listRowsInGroup is using raw JSONB ordering (ORDER BY rowSortKey ASC) but the WHERE bounds use options.fromTable.compareSortKeys, causing inconsistent ordering; update the query so ordering uses the same comparator expression used for bounds—e.g., project the comparator result (or call the same compareSortKeys expression/function) in the SELECT as an alias and ORDER BY that alias, or if the comparator matches JSONB semantics, document that constraint instead; ensure references to rowSortKey and rowIdentifier remain for stable tie-breaking.

Bulldozer DB

4f91839

github-actions bot assigned N2D4 Mar 24, 2026

vercel bot deployed to Preview – stack-auth-hosted-components March 24, 2026 02:52 View deployment

N2D4 added 2 commits March 24, 2026 10:11

declareGroupByTable

9694c33

Fix Prisma schema

09ba416

vercel bot deployed to Preview – stack-auth-hosted-components March 24, 2026 17:13 View deployment

declareMapTable

4232a94

vercel bot deployed to Preview – stack-auth-hosted-components March 24, 2026 17:44 View deployment

Performance tests

31b6ac6

vercel bot deployed to Preview – stack-auth-hosted-components March 24, 2026 17:53 View deployment

Load tests

2f7f09a

vercel bot deployed to Preview – stack-auth-hosted-components March 24, 2026 18:10 View deployment

Interface updates

863ee05

vercel bot deployed to Preview – stack-auth-hosted-components March 24, 2026 18:34 View deployment

Bulldozer Studio

109cf5d

vercel bot deployed to Preview – stack-auth-hosted-components March 25, 2026 01:16 View deployment

Remove unnecessary table

53f7302

vercel bot deployed to Preview – stack-auth-hosted-components March 25, 2026 04:08 View deployment

Add flat map interface

006cf5e

vercel bot deployed to Preview – stack-auth-hosted-components March 25, 2026 17:15 View deployment

FlatMap table

4cdc057

vercel bot deployed to Preview – stack-auth-hosted-components March 25, 2026 17:25 View deployment

Flat map fuzz tests

49dc922

vercel bot deployed to Preview – stack-auth-hosted-components March 25, 2026 17:28 View deployment

Build MapTable from FlatMapTable

3eccc22

vercel bot deployed to Preview – stack-auth-hosted-components March 25, 2026 18:23 View deployment

Filter tables

e041e7d

vercel bot deployed to Preview – stack-auth-hosted-components March 25, 2026 18:50 View deployment

Add left-join table

e306770

vercel bot deployed to Preview – stack-auth-hosted-components March 26, 2026 19:47 View deployment

Sort table

5036c2a

vercel bot deployed to Preview – stack-auth-hosted-components March 27, 2026 17:59 View deployment

LFold table

d55470b

vercel bot deployed to Preview – stack-auth-hosted-components March 27, 2026 18:49 View deployment

Left join table

43d7304

vercel bot deployed to Preview – stack-auth-hosted-components March 27, 2026 21:21 View deployment

Improve left join performance

a33faa0

vercel bot deployed to Preview – stack-auth-hosted-components March 27, 2026 21:51 View deployment

Improved performance for most tables

aaeb7d1

vercel bot deployed to Preview – stack-auth-hosted-components March 27, 2026 21:55 View deployment

Refactor Bulldozer into individual files

ef9915a

vercel bot deployed to Preview – stack-auth-hosted-components March 27, 2026 22:12 View deployment

N2D4 marked this pull request as ready for review March 27, 2026 22:16

Copilot AI review requested due to automatic review settings March 27, 2026 22:16

Copilot started reviewing on behalf of N2D4 March 27, 2026 22:17 View session

Merge branch 'dev' into bulldozer-db

037d20f

N2D4 assigned nams1570 Mar 27, 2026

vercel bot deployed to Preview – stack-auth-hosted-components March 27, 2026 22:19 View deployment

Copilot AI reviewed Mar 27, 2026

View reviewed changes

cursor bot reviewed Mar 27, 2026

View reviewed changes

greptile-apps bot reviewed Mar 27, 2026

View reviewed changes

apps/backend/src/lib/bulldozer/db/index.ts Show resolved Hide resolved

apps/backend/scripts/run-bulldozer-studio.ts Show resolved Hide resolved

apps/backend/scripts/run-bulldozer-studio.ts Show resolved Hide resolved

coderabbitai bot reviewed Mar 27, 2026

View reviewed changes

Update apps/backend/src/lib/bulldozer/db/tables/group-by-table.ts

2403c17

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

cursor bot reviewed Mar 27, 2026

View reviewed changes

vercel bot reviewed Mar 27, 2026

View reviewed changes

apps/backend/prisma/schema.prisma Show resolved Hide resolved

Performance improvements

d3a2daa

coderabbitai bot reviewed Mar 28, 2026

View reviewed changes

Conversation

N2D4 commented Mar 24, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

vercel bot commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning, 1 inconclusive)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot Mar 27, 2026

Choose a reason for hiding this comment

Bulk init ordering inconsistent with custom sort comparison

Uh oh!

Uh oh!

greptile-apps bot commented Mar 27, 2026

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cursor bot Mar 27, 2026

Choose a reason for hiding this comment

Duplicated SQL quoting functions across two files

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

N2D4 commented Mar 24, 2026 •

edited by coderabbitai bot

Loading

vercel bot commented Mar 24, 2026 •

edited

Loading

coderabbitai bot commented Mar 24, 2026 •

edited

Loading