Rust SQLite Lookup And Retrieval Tag Audit¶
- Audit date:
2026-03-18(Asia/Tbilisi) - Repository root:
/home/standard/dspy_rag_in_repo_docs_and_impl1_continue - Git HEAD during final verification:
4c635c712f2143a91da756dd0568e02674acb2ac
Scope¶
This audit records two follow-up changes on top of the upstream repository-benchmark broadening
that landed on origin/master in the same session:
- the repository benchmark summaries now expose per-tag rollups and exclude repo-local
.codexskill surfaces from the benchmark corpus - the Rust wrapper now exposes a SQLite
index/lookuppath so agents can do cheap tracked-file discovery before escalating to DSPy-backed synthesis
Executed Commands¶
Executed successfully in this turn:
make hooks-installuv run python -m compileall src testscargo build --manifest-path rust-cli/Cargo.tomlmake rust-lookup-indexmake rust-lookup QUERY='dspy training'uv run pytest tests/test_utilities.py tests/test_project_surfaces.py tests/test_cli_and_dspy.py tests/test_repository_rag_bdd.pyuv run repo-rag smoke-testuv run repo-rag verify-surfacesuv run repo-rag retrieval-eval --root . --top-k 4 --top-k-sweep 1,2,4,8uv run pytest tests/test_utilities.py tests/test_repository_rag_bdd.pymake quality
Observed and then corrected in this turn:
make rust-lookup QUERY='dspy training'failed withfailed to initialize SQLite schema: database is lockedwhen it was launched in parallel withmake rust-lookup-index. The serialized rerun above passed, which is the supported workflow.
Notable Results¶
cargo build --manifest-path rust-cli/Cargo.toml: passedmake rust-lookup-index: passed and wroteartifacts/sqlite/repo-file-index.sqlite3withindexed=204,skipped_binary=3,skipped_large=2make rust-lookup QUERY='dspy training': passed; the top ranked hits weresrc/repo_rag_lab/notebook_scaffolding.py,README.DSPY.MD,src/repo_rag_lab/training_samples.py, andsrc/repo_rag_lab/dspy_training.pyuv run pytest tests/test_utilities.py tests/test_project_surfaces.py tests/test_cli_and_dspy.py tests/test_repository_rag_bdd.py: passed,39testsuv run repo-rag smoke-test: passed withanswer_contains_repository: true,mcp_candidate_count: 1, andmanifest_path: artifacts/azure/repo-rag-smoke.jsonuv run repo-rag verify-surfaces: passed withissue_count: 0uv run repo-rag retrieval-eval --root . --top-k 4 --top-k-sweep 1,2,4,8: passed and reported:benchmark_count: 8- default
top_k: 4 pass_rate: 1.0fully_covered_rate: 1.0average_source_recall: 1.0average_source_precision: 0.46875average_reciprocal_rank: 1.0best_pass_rate_top_k: 4- non-empty
tag_summariesfor benchmark tags such asagents,api,azure,docs,notebooks,publication,rag, andutilities uv run pytest tests/test_utilities.py tests/test_repository_rag_bdd.py: passed,13testsmake quality: passed with120tests and88.02%total coverage
Current Verification Status¶
Configured and verified in this turn:
- Compile checks: present and passed through
uv run python -m compileall src tests - Rust wrapper build: present and passed through
cargo build --manifest-path rust-cli/Cargo.toml - Rust SQLite index and lookup path: present and passed through
make rust-lookup-indexand the serializedmake rust-lookup QUERY='dspy training' - Utility, project-surface, CLI, and BDD pytest coverage for the changed user-facing surfaces:
present and passed through the targeted
uv run pytest ...slice above - Repository smoke test: present and passed through
uv run repo-rag smoke-test - Repository-surface verification: present and passed through
uv run repo-rag verify-surfaces - Retrieval-quality evaluation utility: present and passed through
uv run repo-rag retrieval-eval --root . --top-k 4 --top-k-sweep 1,2,4,8 - Baseline utility and BDD pytest slice: present and passed through
uv run pytest tests/test_utilities.py tests/test_repository_rag_bdd.py - Full repository quality gate: present and passed through
make quality
Still absent or not exercised in this turn:
- UI or browser tests: none found in repository configuration
- Full notebook batch execution: notebook lint and surface checks passed through
make quality, butmake notebook-reportwas not rerun end-to-end in this turn - Live Azure OpenAI and Azure AI Inference probes: not rerun in this turn
- Post-push GitHub Actions evidence: not yet available before the push for this change set
Notes¶
- The upstream broadened benchmark corpus from
origin/masterwas kept intact in this turn; the local retrieval change is the reporting layer around it, not a replacement of those 8 training examples. src/repo_rag_lab/benchmarks.pynow excludes.codexfrom the benchmark corpus so repo-local skill instructions do not inflate retrieval-eval results.summarize_benchmark_results(...)now includestag_summaries, which flow through the CLI, utility helpers, and notebook scaffolding without changing the answer-generation path itself.- The Rust lookup path is intentionally lightweight and local-first: it builds an ignored SQLite FTS index over tracked UTF-8 files and prints ranked path/snippet hits before any DSPy step.