AI & Digital Manufacturing Analytics worked example

AI Training Data Balance with production lots represented of 120 lots: a worked example

Push production lots represented up to 120 lots and the picture changes. This example computes every intermediate figure at that operating point. a data scientist needs to estimate training samples across lots and process runs

The inputs for this scenario

  • Production lots represented: 120 lots (raised for this scenario; the documented default is 48)
  • Runs captured per lot: 6 runs/lot (unchanged)
  • Labeled samples per run: 35 samples/run (unchanged)

Working through the calculation

  • Applying the documented formula (Total AI training samples = production lots represented × runs captured per lot × labeled samples per run) to the inputs above produces each figure below.
  • At this operating point the engine returns 25,200 samples for total ai training samples, the number this scenario is built around.
  • At this operating point the engine returns 1,260 hr for estimated labeling review hours.
  • At this operating point the engine returns 6 lots for production lots represented.
  • At this operating point the engine returns 35 samples/run for labeled samples per run.

How this compares with the baseline

  • Against the tool's baseline example, where production lots represented sits at 48 lots and the headline result is 10,080 samples, this scenario comes in 150% above the baseline at 25,200 samples.
  • It multiplies production lots, runs captured per lot, and labeled samples per run to give total training samples, then derives an estimated labeling review workload in hours. The value of this scenario is the size of the gap it exposes: that gap, priced out over a year, is the budget you can justify spending to close it.

Results at a glance

  • Total AI training samples: 25,200 samples (headline result)
  • Estimated labeling review hours: 1,260 hr
  • Production lots represented: 6 lots
  • Labeled samples per run: 35 samples/run

Run it with your numbers

  • Every input above is editable in the live AI Training Data Balance calculator, which recalculates instantly and can be shared with the inputs intact.

Last reviewed 2026-05-12.