Published Reproduction Package
The corpus is fixed by seed 20260619. Download the CSV cases, machine-readable summary, and Python generator ZIP. The generator uses only Python standard-library modules for the case generation and heuristic baselines.
Results
Many public container-loading calculators expose only one answer for one manually entered case. This report publishes the case generator and the full input corpus, so the result is not a single showcase example.
The benchmark uses one carton type loaded into a fixed 40HQ-style interior space of 12032 x 2352 x 2698 mm. All six axis-aligned carton orientations are allowed. SmartPacker T1 is compared with transparent baselines that a reader can inspect and rerun.
| Comparison | Wins | Ties | Losses | Average gain |
|---|---|---|---|---|
| T1 depth 3 vs one-cut heuristic | 7,961 | 2,039 | 0 | +2.633% cartons |
| T1 depth 3 vs single-grid baseline | 9,109 | 891 | 0 | +9.784% cartons |
| T1 depth 3 vs T1 depth 2 | 5,301 | 4,699 | 0 | +0.453% cartons |
Median fill rate
95.953% across the 10,000 depth-3 T1 results.
Best transparent baseline
The one-cut baseline tries a guillotine slab split and grids both regions.
Volume bound
The CSV includes a raw volume upper bound, but it is not always geometrically reachable.
Public App Spot Checks
We also ran a dated, no-login public-app spot check on June 19, 2026. The exact test case was a 12032 x 2352 x 2698 mm loading space with 600 x 400 x 350 mm cartons, allowing all six orientations where the public app exposed orientation controls. This is a small public verification sample, not a vendor-neutral leaderboard.
| Tool | Access | Dimension match | Reported result | How to read it |
|---|---|---|---|---|
| SmartPacker public MCP | No account | Exact | 876 cartons, 96.4% | Reference result from the public depth-2 live-layout service. |
| Pier2Pier 3D Load Calculator | No account | Exact | 820 cartons, 90.2% | Strict public competitor spot check. SmartPacker loaded 56 more cartons, about 6.8% more than Pier2Pier on this case. |
| SeaRates Load Calculator | No account | Built-in 40' High Cube, not exact custom dimensions | 770 cartons, 84% | Useful public-app datapoint, but not a strict exact-dimension comparison. |
| CargoesPi Container Loading Calculator | No account | Exact entered dimensions | 920 cartons, 101.2% | Not used as a valid competitor score because it exceeds the 908-carton raw volume upper bound. |
| EasyCargo | Sign-up / free-trial gate | Not run | No public no-login result | Requires account access before comparable calculation. |
| Goodloading | Account / demo gate | Not run | No public no-login result | Requires account or demo access before comparable calculation. |
| CargoLoader3D | Login gate | Not run | No public no-login result | Requires login before comparable calculation. |
Depth 3 Release Policy
This report uses local count-only SmartPacker T1 depth-3 batch results. It is not a claim that the public MCP placed-layout endpoint runs depth 3.
The public MCP service remains a fast depth-2 live-layout service because it must return placed cartons and a 3D image within the public timeout. Depth 3 will be released in the downloadable T1 desktop app and a local CLI, so anyone can run the benchmark on their own PC and inspect generated loading plans locally without adding load to the public server.
Reproduce
To regenerate the same cases and heuristic baselines from the extracted script:
python container_loading_benchmark_generator.py --out-dir . --skip-packcli
After the local T1 CLI package is released, run the same generator with the CLI path to add SmartPacker T1 counts:
python container_loading_benchmark_generator.py --out-dir . --packcli path\to\PackCli.exe
The current published CSV already includes the depth-0 through depth-3 T1 count columns from the internal June 19, 2026 run.
By Stratum
| Stratum | Cases | Wins | Ties | Losses | Average gain | Average T1 fill |
|---|---|---|---|---|---|---|
| small_dense | 1,000 | 964 | 36 | 0 | +1.743% | 98.947% |
| medium_carton | 1,000 | 874 | 126 | 0 | +2.615% | 95.908% |
| large_carton | 1,000 | 600 | 400 | 0 | +3.508% | 85.887% |
| flat_panel | 1,000 | 909 | 91 | 0 | +3.295% | 94.126% |
| tall_column | 1,000 | 841 | 159 | 0 | +3.224% | 93.922% |
| long_case | 1,000 | 483 | 517 | 0 | +1.713% | 86.458% |
| near_divisor | 1,000 | 773 | 227 | 0 | +2.063% | 96.413% |
| awkward_remainder | 1,000 | 861 | 139 | 0 | +3.053% | 94.837% |
| mixed_scale | 1,000 | 775 | 225 | 0 | +2.525% | 92.455% |
| realistic_carton | 1,000 | 881 | 119 | 0 | +2.593% | 96.045% |
Scope and Caveats
- This is a single-SKU benchmark. It does not test mixed-SKU loading, pallet rules, fragile cargo, axle weight, loading sequence, or manual edit workflows.
- The transparent baselines are published for reproducibility. They are not a substitute for every commercial product's private algorithm.
- The named public-app results above are single-case spot checks. Bulk automated testing of competitor web apps can be fragile and may violate account or rate-limit rules.
- Future report revisions should add downloadable desktop-app and CLI verification instructions once the public local package is upgraded.
Next Step
The next product step is to upgrade the downloadable T1 desktop app and add a local CLI package. That release will make depth-3 verification self-contained: run the CLI for the benchmark count, then open the same case in the local app to inspect the loading plan.
Questions and comments: zhzx@zhihuo.com.