Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
68 commits
Select commit Hold shift + click to select a range
dc041f7
Add generation type to ModelConfig
nabinchha Nov 25, 2025
0d6b830
pass tests
nabinchha Nov 25, 2025
254fd8a
added generate_text_embeddings
nabinchha Nov 25, 2025
1126ea1
tests
nabinchha Nov 25, 2025
744bc8f
remove sensitive=True old artifact no longer needed
nabinchha Nov 25, 2025
b913f8d
Slight refactor
nabinchha Nov 26, 2025
052db7a
slight refactor
nabinchha Nov 26, 2025
5504c8d
Added embedding generator
nabinchha Nov 26, 2025
4b6f877
chunk_separator -> chunk_pattern
nabinchha Nov 26, 2025
04fc0f3
update tests
nabinchha Nov 26, 2025
26d6da1
rename for consistency
nabinchha Nov 26, 2025
6facbd2
Restructure InferenceParameters -> CompletionInferenceParameters, Bas…
nabinchha Nov 26, 2025
2c1b267
Remove purpose from consolidated kwargs
nabinchha Nov 26, 2025
4b1492b
WithModelConfiguration.inference_parameters should should be typed wi…
nabinchha Dec 2, 2025
c445caf
Type as WithModelGeneration
nabinchha Dec 2, 2025
4b8aa2b
Add image generation modality
nabinchha Dec 2, 2025
2c5933f
update return type for generate_kwargs
nabinchha Dec 3, 2025
c6c29d4
make generation_type a field of ModelConfig as opposed to a prop reso…
nabinchha Dec 3, 2025
06a724b
remove regex based chunking from embedding generator
nabinchha Dec 3, 2025
6b9733f
Merge branch 'main' into nmulepati/feat/support-embedding-and-image-g…
nabinchha Dec 23, 2025
81949e6
Merge branch 'main' into nmulepati/feat/support-embedding-and-image-g…
nabinchha Feb 3, 2026
f291033
save progress
nabinchha Feb 4, 2026
e0a4657
Merge branch 'main' into nmulepati/feat/125-support-image-generation
nabinchha Feb 5, 2026
1506ab5
Simplify to ImageInferenceParams. Persist images in create mode to disk
nabinchha Feb 6, 2026
ed9787b
support generation of multiple images
nabinchha Feb 6, 2026
7dea87a
clean up visualization
nabinchha Feb 6, 2026
31cc24e
clean up some util methods + add tests
nabinchha Feb 6, 2026
0f07f7b
Streamline integration for image generation
nabinchha Feb 6, 2026
2aae6cc
streamline generation
nabinchha Feb 7, 2026
1677f06
track images generated in usage
nabinchha Feb 9, 2026
3b4acf1
fix image usage tracking
nabinchha Feb 9, 2026
33b4211
test clean up
nabinchha Feb 9, 2026
fad791e
Small refactor for simplicity
nabinchha Feb 9, 2026
54ebcc8
update ImageInferenceParams
nabinchha Feb 9, 2026
3aad608
add example tutorial for image generation
nabinchha Feb 9, 2026
f252c37
support multi-modal context in ImageColumnConfig
nabinchha Feb 10, 2026
d6a0f2f
updated tutorial notebook
nabinchha Feb 10, 2026
f5c6cf9
organize image artifacts by column name
nabinchha Feb 10, 2026
71e2bac
address pr comments
nabinchha Feb 10, 2026
46138d8
fix license headers
nabinchha Feb 10, 2026
b187ff4
Merge branch 'main' into nmulepati/feat/125-support-image-generation
nabinchha Feb 10, 2026
deb5fc2
generate collab notebooks
nabinchha Feb 10, 2026
d11d049
move pillow to lib dep from notebook
nabinchha Feb 10, 2026
511e1f2
update uv lock"
nabinchha Feb 10, 2026
2b22df8
remove legacy flag from display_sample_record
nabinchha Feb 10, 2026
9239544
remove unnecessary override of generate kwargs
nabinchha Feb 10, 2026
3a779aa
Restore some changes not needed
nabinchha Feb 10, 2026
33b6cd9
use a specific image generation exception instead of generic ModelAPI…
nabinchha Feb 10, 2026
3a98caf
more cleanup
nabinchha Feb 10, 2026
cd39941
more tests for hf image folder upload
nabinchha Feb 10, 2026
52e023d
Fix test
nabinchha Feb 10, 2026
c53a1dc
set init=False for media_storage
nabinchha Feb 10, 2026
8f813b1
handle image url in _display_image_if_in_notebook
nabinchha Feb 10, 2026
281859b
Merge branch 'main' into nmulepati/feat/125-support-image-generation
nabinchha Feb 10, 2026
782a346
Fix path traversal vulnerability in MediaStorage subfolder handling
nabinchha Feb 10, 2026
2d7a202
Fix PIL format detection in detect_image_format
nabinchha Feb 10, 2026
5fca3a6
Fix Pydantic v2 compatibility in ArtifactStorage
nabinchha Feb 10, 2026
cf2b364
Address PR review comments
nabinchha Feb 10, 2026
5aa7e10
Use regex for base64 character validation in is_base64_image
nabinchha Feb 10, 2026
ecaeb72
move to a constant
nabinchha Feb 10, 2026
622b1c4
fix pyproject.toml
nabinchha Feb 10, 2026
400e97b
regen colab notebooks
nabinchha Feb 10, 2026
469a3d2
raise a ValueError if we fail to detect image format
nabinchha Feb 10, 2026
1e43394
Fix diffusion image gen
nabinchha Feb 11, 2026
8f6be9b
Add requests to config pyproject.toml
nabinchha Feb 11, 2026
d85ccb3
Merge branch 'main' into nmulepati/feat/125-support-image-generation
nabinchha Feb 11, 2026
87dcab1
address pr feedback from andre
nabinchha Feb 11, 2026
12cc1fe
Merge branch 'main' into nmulepati/feat/125-support-image-generation
nabinchha Feb 11, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
66 changes: 34 additions & 32 deletions docs/colab_notebooks/1-the-basics.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
"cells": [
{
"cell_type": "markdown",
"id": "c79eea7a",
"id": "96178d08",
"metadata": {},
"source": [
"# 🎨 Data Designer Tutorial: The Basics\n",
Expand All @@ -14,7 +14,7 @@
},
{
"cell_type": "markdown",
"id": "2476f160",
"id": "1d02a1d6",
"metadata": {},
"source": [
"### πŸ“¦ Import Data Designer\n",
Expand All @@ -26,7 +26,7 @@
},
{
"cell_type": "markdown",
"id": "3646f62e",
"id": "2292d817",
"metadata": {},
"source": [
"### ⚑ Colab Setup\n",
Expand All @@ -37,7 +37,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "3348e5c8",
"id": "8af621fc",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -48,7 +48,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "19cd9249",
"id": "70e6a11c",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -66,7 +66,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "5a6d13a9",
"id": "41031828",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -76,7 +76,7 @@
},
{
"cell_type": "markdown",
"id": "d445af5b",
"id": "0b480b10",
"metadata": {},
"source": [
"### βš™οΈ Initialize the Data Designer interface\n",
Expand All @@ -89,7 +89,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "4df0031d",
"id": "d434a8e2",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -98,7 +98,7 @@
},
{
"cell_type": "markdown",
"id": "0f69b576",
"id": "f88f6792",
"metadata": {},
"source": [
"### πŸŽ›οΈ Define model configurations\n",
Expand All @@ -115,7 +115,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "65d9be99",
"id": "4261574c",
"metadata": {},
"outputs": [],
"source": [
Expand Down Expand Up @@ -145,7 +145,7 @@
},
{
"cell_type": "markdown",
"id": "72582d09",
"id": "bbbc3d58",
"metadata": {},
"source": [
"### πŸ—οΈ Initialize the Data Designer Config Builder\n",
Expand All @@ -160,7 +160,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "8d7992b4",
"id": "92c0cf35",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -169,7 +169,7 @@
},
{
"cell_type": "markdown",
"id": "741a15a0",
"id": "44246c7d",
"metadata": {},
"source": [
"## 🎲 Getting started with sampler columns\n",
Expand All @@ -186,7 +186,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "c3879c70",
"id": "07d20f3f",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -195,7 +195,7 @@
},
{
"cell_type": "markdown",
"id": "1575ef81",
"id": "9d3c87b0",
"metadata": {},
"source": [
"Let's start designing our product review dataset by adding product category and subcategory columns.\n"
Expand All @@ -204,7 +204,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "87a88d7b",
"id": "c646b021",
"metadata": {},
"outputs": [],
"source": [
Expand Down Expand Up @@ -285,7 +285,7 @@
},
{
"cell_type": "markdown",
"id": "8c74b738",
"id": "ff18b032",
"metadata": {},
"source": [
"Next, let's add samplers to generate data related to the customer and their review.\n"
Expand All @@ -294,7 +294,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "4eb1da1f",
"id": "78846d99",
"metadata": {},
"outputs": [],
"source": [
Expand Down Expand Up @@ -331,7 +331,7 @@
},
{
"cell_type": "markdown",
"id": "4324d869",
"id": "97059bfc",
"metadata": {},
"source": [
"## 🦜 LLM-generated columns\n",
Expand All @@ -346,7 +346,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "1302a503",
"id": "98c66eff",
"metadata": {},
"outputs": [],
"source": [
Expand Down Expand Up @@ -382,7 +382,7 @@
},
{
"cell_type": "markdown",
"id": "7cf8241b",
"id": "ff2d52b9",
"metadata": {},
"source": [
"### πŸ” Iteration is key – preview the dataset!\n",
Expand All @@ -399,7 +399,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "6fc6cf39",
"id": "6e622478",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -409,7 +409,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "c929e068",
"id": "1addc7d8",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -420,7 +420,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "dfb04e2a",
"id": "7af4b9c3",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -430,7 +430,7 @@
},
{
"cell_type": "markdown",
"id": "adb879da",
"id": "91d0ee89",
"metadata": {},
"source": [
"### πŸ“Š Analyze the generated data\n",
Expand All @@ -443,7 +443,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "ff58dd9f",
"id": "e1e3aed0",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -453,7 +453,7 @@
},
{
"cell_type": "markdown",
"id": "57c7355d",
"id": "6eaa402e",
"metadata": {},
"source": [
"### πŸ†™ Scale up!\n",
Expand All @@ -466,7 +466,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "df49db99",
"id": "f6b148d4",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -476,7 +476,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "2bbc48dd",
"id": "f4e62e5b",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -489,7 +489,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "dc0673fa",
"id": "7d426ab0",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -501,7 +501,7 @@
},
{
"cell_type": "markdown",
"id": "7688217b",
"id": "449d003c",
"metadata": {},
"source": [
"## ⏭️ Next Steps\n",
Expand All @@ -512,7 +512,9 @@
"\n",
"- [Seeding synthetic data generation with an external dataset](https://nvidia-nemo.github.io/DataDesigner/latest/notebooks/3-seeding-with-a-dataset/)\n",
"\n",
"- [Providing images as context](https://nvidia-nemo.github.io/DataDesigner/latest/notebooks/4-providing-images-as-context/)\n"
"- [Providing images as context](https://nvidia-nemo.github.io/DataDesigner/latest/notebooks/4-providing-images-as-context/)\n",
"\n",
"- [Generating images](https://nvidia-nemo.github.io/DataDesigner/latest/notebooks/5-generating-images/)\n"
]
}
],
Expand Down
Loading