docs: add deployment, performance tuning guides and streamline gettin… #277

kirit93 · 2026-01-31T03:35:55Z

Updated docs to simplify introduction, talk about library vs. ms, and give more details on inference.

greptile-apps · 2026-01-31T03:38:51Z

Greptile Overview

Greptile Summary

This PR significantly improves the documentation by consolidating the getting started experience and adding comprehensive guides on deployment and performance optimization.

Major improvements:

Merged installation.md and quick-start.md into a streamlined index.md that provides installation, setup, and a first dataset example in one place
Added architecture-and-performance.md with detailed execution model explanation, concurrency formulas, and tuning guidelines
Added deployment-options.md comparing library vs microservice deployment with decision flowchart
Introduced a Dev Notes blog section with mkdocs-material blog plugin configuration
Updated all internal cross-references from deleted pages to the new consolidated index.md

Known issues (already flagged in previous reviews):

Broken link in docs/index.md to notebooks/README.md (should be notebook_source/README.md)
Broken link in docs/blog/posts/welcome.md to ../../notebooks/README.md
Future date in welcome blog post (2026-01-22 should likely be 2025-01-22)
Heading structure inconsistency in deployment-options.md

The documentation restructuring is well-executed and makes the getting started experience much more cohesive. The new performance and deployment guides fill important gaps.

Confidence Score: 4/5

This PR is safe to merge with minor documentation issues that should be addressed
Documentation changes only with no code modifications. The restructuring is logical and well-executed. The score reflects the presence of broken links and a date issue that were already flagged but should be fixed before or shortly after merge.
Fix broken links in docs/index.md and docs/blog/posts/welcome.md, and verify the blog post date

Important Files Changed

Filename	Overview
docs/blog/posts/welcome.md	Added welcome blog post (has broken link and future date already flagged)
docs/concepts/architecture-and-performance.md	Added comprehensive performance tuning guide with execution model and configuration parameters
docs/concepts/deployment-options.md	Added deployment guide comparing library vs microservice (has heading structure issue already flagged)
docs/index.md	Streamlined welcome page with installation, setup, and first dataset example (has broken link already flagged)
mkdocs.yml	Updated navigation to remove deleted pages, add new guides, and configure blog plugin

Sequence Diagram

sequenceDiagram
    participant User
    participant IndexMD as docs/index.md
    participant ArchGuide as architecture-and-performance.md
    participant DeployGuide as deployment-options.md
    participant ModelDocs as Model Documentation
    participant Blog as Dev Notes Blog

    User->>IndexMD: Visit documentation homepage
    Note over IndexMD: Now includes installation & quick start<br/>(merged from deleted files)
    
    IndexMD->>User: Show install, setup, first dataset example
    
    User->>DeployGuide: Learn about deployment options
    Note over DeployGuide: New: Library vs Microservice guide
    DeployGuide->>User: Decision flowchart & comparison
    
    User->>ArchGuide: Optimize performance
    Note over ArchGuide: New: Comprehensive performance guide
    ArchGuide->>User: Execution model, tuning parameters,<br/>concurrency formulas
    
    User->>ModelDocs: Configure models
    Note over ModelDocs: Updated links to index.md<br/>(removed quick-start.md references)
    ModelDocs->>ArchGuide: Cross-reference for concurrency
    
    User->>Blog: Read dev notes
    Note over Blog: New: Blog section with welcome post<br/>(mkdocs-material blog plugin)
    Blog->>User: Team insights & deep dives

greptile-apps

_{5 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

docs/index.md

greptile-apps

_{5 files reviewed, 2 comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-01-31T18:24:17Z

docs/concepts/deployment-options.md

+
+The library is the right choice for most users. Choose it if you:
+
+### You Have Access to LLMs


heading structure inconsistent with rest of section - "You Have Access to LLMs" is a level 3 heading but reads like it should be a bullet point or part of the previous section's explanation

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}

greptile-apps · 2026-01-31T18:24:19Z

docs/concepts/performance-tuning.md

+
+- [Inference Architecture](inference-architecture.md): Understanding separation of concerns
+- [Model Configuration](models/model-configs.md): Complete model settings reference
+- [RunConfig Reference](../code_reference/run_config.md): API documentation


broken link to non-existent ../code_reference/run_config.md - the code_reference directory doesn't exist in the repository

…g started - Add deployment-options.md: Library vs. Microservice decision guide - Add inference-architecture.md: Separation of concerns with LLM servers - Add performance-tuning.md: Concurrency and batching optimization guide - Streamline index.md: Merge installation, add quick example, simplify - Remove quick-start.md: Content merged into welcome page - Remove installation.md: Content merged into welcome page - Update model docs: Add concurrency control sections and cross-references - Update mkdocs.yml: Add new Architecture section to navigation

greptile-apps

_{3 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-02-02T18:49:54Z

docs/concepts/inference-architecture.md

+---
+
+---


duplicate horizontal rules (two --- in a row)

Suggested change

---

---

---

## Next Steps

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}

- Remove duplicate max_parallel_requests tables from model-configs.md and inference-parameters.md - Remove duplicate Concurrency Control section from model-configs.md - Simplify Concurrency Control in inference-parameters.md to link to performance-tuning.md - Remove Troubleshooting section from inference-architecture.md (covered in performance-tuning.md) - performance-tuning.md is now the authoritative source for tuning guidance

greptile-apps

_{4 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps

_{4 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-02-02T21:17:38Z

Additional Comments (1)

mkdocs.yml
missing blog plugin configuration - the PR adds blog files (.authors.yml, blog/index.md, blog/posts/welcome.md) but doesn't configure the MkDocs blog plugin

  - blog
  - search

nabinchha · 2026-02-02T21:14:49Z

docs/concepts/architecture-and-performance.md

+
+---
+
+## Execution Model


I'd perhaps call out that this is our current column-wise dataset generator... with other dataset generator in the works...

nabinchha · 2026-02-02T21:16:32Z

docs/concepts/architecture-and-performance.md

+```
+┌─────── Batch 1 (buffer_size records) ────────────────────────────────────────┐
+│                                                                               │
+│  Column 1 (Sampler):  ════════►  (non_inference_max_parallel_workers)        │


May be worth calling out that this is just an example. While LLM columns will always come after samplers, expression columns may come before certain llm columns

nabinchha · 2026-02-02T21:18:07Z

docs/concepts/architecture-and-performance.md

+```python
+from data_designer.config import RunConfig
+
+run_config = RunConfig(buffer_size=2000)


may be show how to set this? data_designer.set_run_config(...)

should we fllow dd.config.RunConfig pattern in these examples?

nabinchha · 2026-02-02T21:21:48Z

docs/concepts/architecture-and-performance.md

+|-------------------|-------------------|
+| NVIDIA API Catalog | 4-8 |
+| Self-hosted vLLM (single GPU) | 8-16 |
+| Self-hosted vLLM (multi-GPU) | 16-64 |


this is dependent on the vLLM server stack + model performance. Could easily be as high as 1024, for example. It would be good to suggest running some benchmarks to record how long generation took across different values of parallelism until the vllm sever is saturated and the sdg run time doesn't decrease.

greptile-apps

_{5 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps

_{5 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-02-02T22:40:04Z

docs/index.md

+
+<div class="grid cards" markdown>
+
+-   :material-book-open-variant: **[Tutorials](notebooks/README.md)**


broken link - notebooks/README.md doesn't exist (file is at notebook_source/README.md). Check if there's a build step that creates it or update the link.

greptile-apps · 2026-02-02T22:40:05Z

docs/blog/posts/welcome.md

+
+Watch this space for technical deep dives into synthetic data generation, model customization, and more. We'll be sharing insights from our work and the broader community.
+
+In the meantime, check out our [Welcome guide](../../index.md) and [tutorial notebooks](../../notebooks/README.md) to get started with Data Designer.


broken link - ../../notebooks/README.md doesn't exist (file is at ../../notebook_source/README.md)

greptile-apps · 2026-02-02T22:40:06Z

docs/blog/posts/welcome.md

@@ -0,0 +1,26 @@
+---
+date: 2026-01-22


date is in the future (2026-01-22). Should this be 2025-01-22?

greptile-apps

_{5 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

kirit93 requested a review from a team as a code owner January 31, 2026 03:35

greptile-apps bot reviewed Jan 31, 2026

View reviewed changes

johnnygreco reviewed Jan 31, 2026

View reviewed changes

docs/index.md Show resolved Hide resolved

greptile-apps bot reviewed Jan 31, 2026

View reviewed changes

kirit93 added 2 commits February 2, 2026 10:40

docs: add tasteful emojis to new documentation pages

dbdde75

kirit93 force-pushed the doc-improvements branch from 858f5e6 to 90f3db3 Compare February 2, 2026 18:46