The Mechanical Bottleneck

Epistemic status: Research synthesis with interpretation. I'm writing as someone reading the field, not someone building in it. The empirical claims are sourced. The three implications I draw are at different confidence levels, flagged where they get speculative.

The thing that will pace humanoid robot deployment over the next decade is not a chip. It is not data. It is not even labor. It is a small geared part most engineers can't name, made by a handful of Japanese suppliers, and the global supply caps humanoid production at roughly 500,000 units a year regardless of how good the underlying AI gets.

This is the central finding of Epoch AI's April 2026 piece, "How Fast Could Robot Production Scale Up?" It is, as far as I can tell, the cleanest published work on physical-AI deployment timelines. The bottleneck the piece names is precision reducers: planetary gearboxes, cycloidal/RV reducers, strain-wave gears. Tiny mechanical components made by firms most AI people couldn't list if asked.

The post draws out what that one rigorous empirical finding implies for three things that aren't getting enough attention: timeline forecasting, industrial policy, and embodied AI safety.

Capability got cheap. Deployment didn't. The bottleneck is mechanical, not silicon, and the AI discourse hasn't mapped it.

What Epoch found

The piece is a bottom-up bill-of-materials analysis across five form factors: humanoids, quadrupeds, robotic arms, wheeled robots, drones. Authors Jean-Stanislas Denain and Yann Rivière mapped global component supply against production demand, then compared the result against historical mobilization cases including WWII aircraft, Tesla Shanghai factory builds, and Ukraine FPV drone production.

The current state of humanoid production is small. About 16,000 units per year in 2025, doubling every six months. The doubling rate is anomalous, Epoch notes. Most of those units are research platforms and marketing demonstrations, not productive work. The real test is what the production curve looks like once capability arrives and somebody actually needs millions of humanoids.

Their projection, under a demand-shock scenario triggered at end of 2027, is that humanoid production reaches 1.5 to 3 million units per year by end of 2030 clearly, and 5 to 10 million plausibly. Drones could reach 100 to 200 million per year without especially hard limits. Quadrupeds 8 to 15 million. These are not the kind of numbers that move you off your timeline if you were expecting an AI rollout shaped like the smartphone wave. Smartphones hit a billion units shipped in six years from the iPhone launch. The Epoch projection puts humanoids at less than 1 percent of that volume over a similar window.

What constrains the rate is not the things the AI discourse usually worries about. Cameras come off production lines at 7 billion units per year globally. MEMS sensors at 31 billion. Bearings at tens of billions. Batteries at 2 billion kWh per year. None of these are binding. None of these sit on the critical path.

The chokepoint is precision reducers. Planetary gearboxes at roughly 3 million units per year. Cycloidal and RV reducers at roughly 2 million. Strain-wave gears at roughly 12 million, the least restrictive of the three. Humanoids use somewhere between 20 and 40 reducers each, depending on hand dexterity. Tesla Optimus, with its high-degree-of-freedom hands, could triple per-robot reducer demand. The math caps humanoid production at roughly 500,000 per year at current capacity, no matter how many factories you build.

The supply concentration matters. Harmonic Drive Systems (HDS, listed on the Tokyo Stock Exchange) and Nabtesco, both Japanese, dominate the precision-reducer market. HDS scaled its Ariake plant from 150,000 to 220,000 units per month by August 2022 to meet industrial-robot demand, with the Hotaka facility adding another 40,000 monthly. Total HDS capacity sits around 3 million units per year. Nabtesco is the dominant supplier in RV reducers used in industrial-robot joints. China has been pushing localization aggressively. Leaderdrive holds 30 to 40 percent of China's harmonic-reducer market and counts Tesla among its customers. But a Jamestown Foundation analysis notes that while 80 percent of RV reducers are now assembled in China, roughly 90 percent of the machine tools used to make them are imported, mostly from Japan.

Why mechanical precision is the binding constraint

The first-principles answer to why mechanical precision is the bottleneck rather than something else comes from looking at how each potential bottleneck scales.

Chips scale with fabrication capacity, and the world has been pouring capex into fab construction since the CHIPS Act. The marginal chip for a humanoid is not where the bottleneck lives. Batteries are similarly in a phase of aggressive capacity expansion driven by EVs. Cameras and MEMS sensors get produced at volumes orders of magnitude larger than humanoid demand could plausibly require.

Labor doesn't bottleneck because factory operations workers, which Epoch estimates at 40,000 to 120,000 for 10 million humanoids per year, can be trained in parallel during the construction phase. The construction phase itself is the timing constraint. Auto-plant retrofits run 6 to 10 months. Greenfield Chinese factory builds run 6 to 9 months. Greenfield Western builds run 2 years or more.

Precision reducers are different because the manufacturing process is itself a deep skill stack. These are gear-on-gear interfaces machined to micrometer tolerances, with proprietary metallurgy and grinding processes that took specific Japanese firms decades to refine. They are not a thing you can produce by buying CNC machines and hiring trained workers. The machine tools themselves, the precision grinders and gear-cutters used to make reducers, are largely Japanese-made. China's domestic reducer production has scaled, but the Jamestown analysis flagged that it sits on top of imported machine tools from the same source.

This is a constraint with the shape of TSMC, not the shape of Foxconn. TSMC isn't a chokepoint because chip fabrication is conceptually hard. It is a chokepoint because the tacit knowledge of running advanced fabs is concentrated in one firm. Precision reducers are similar. The capacity expansion path does not bottleneck on capital, raw materials, or labor. It bottlenecks on tacit metallurgical knowledge and on the upstream machine tools that themselves bottleneck on the same Japanese firms.

Year 1 of any demand shock, Epoch notes, is mostly construction rather than production. By end of 2028, even with aggressive scale-up, the global stock of newly-produced humanoids would be in the low hundreds of thousands.

Reducer capacity could 2x by 2028 and 4x by 2030 on baseline trajectory, or 10x to 30x under demand shock. The 10x-to-30x range is where most of the projection's uncertainty lives. It is also where the question becomes interesting: how fast can Japanese firms expand without losing the precision that made them the suppliers in the first place? That is not a question with an obvious answer.

Implication 1: Timeline forecasting

This finding reshapes timeline forecasting for AI in a specific way. The AI safety and forecasting community has done thoughtful work on capability timelines and very little on deployment.

Ajeya Cotra's "Forecasting Transformative AI with biological anchors" (Open Philanthropy, 2020-2022) is the most-cited capability timeline analysis. It anchors compute requirements to biological reference points and projects hardware-and-spending trends. Deployment is bracketed as a downstream consequence with minimal physical-world friction modeled.

Holden Karnofsky's "Most Important Century" (Cold Takes, 2021) introduces PASTA, his term for the process of automating scientific and technological advancement, and assumes that once AI can automate research, physical deployment follows quickly via AI-designed robotics. He flags this as a load-bearing assumption.

Leopold Aschenbrenner's "Situational Awareness" (June 2024) makes essentially the same move. Intelligence-explosion thesis, with physical deployment treated as a fast follow-on. The deployment treatment is the weakest part of the document.

The exception is Dario Amodei's "Machines of Loving Grace" (October 2024). Amodei explicitly addresses deployment lag, coining the phrase "limits to compressed 21st century." He estimates 5 to 10 years for biology breakthroughs to deploy through clinical trials and regulatory approval. He does not analyze robotics supply chains specifically, but he is the only frontier-lab CEO publicly grappling with the fact that capability does not equal deployment.

The Epoch piece fills a real hole in this literature. It is one of the few rigorous bottom-up deployment analyses from the AI-aware community.

What does the analysis change about timeline forecasting? Three things.

First, it sharpens the capability/deployment distinction. If you believe AI capability will arrive in the next three to five years and you're thinking about what the world looks like after that, the Epoch numbers should pull your humanoid-deployment estimates downward. Not because capability slips. Because deployment was always going to take three to five more years after capability already arrives.

Second, it gives a falsifiable reference class. Industrial robots took fifty years to reach a million deployed units, according to International Federation of Robotics data. They are now at around 3.5 million in operational stock. Humanoid bull projections, including Tesla's stated target of "millions per year" by the late 2020s and Figure's commercial deployment framing, imply collapsing that 50-year history into roughly five years. The Epoch analysis says the industry-wide ceiling under demand shock is 5 to 10 million per year by end of 2030, which would require essentially the entire global reducer supply chain to be captured by one or two players. That is not impossible, but it is the implausible case, not the central one.

Third, it explains the Rodney Brooks pattern. Brooks has maintained an annual predictions scorecard for over a decade tracking robotics and self-driving forecasts. The pattern across the field is consistent. Capability demos arrive roughly on schedule. Deployment-at-scale runs 3 to 10x slower than industry forecasts. The Epoch analysis suggests the slowdown isn't psychology or marketing. It is the supply chain. If you forecast deployment as though it tracks capability, you systematically miss by an order of magnitude.

The cleanest data point on the gap between marketing and reality: Agility Robotics' Salem, Oregon factory, with stated capacity around 10,000 humanoids per year. That is the verifiable number. Tesla and Figure imply millions. Agility shows ten thousand. The ratio is the gap between marketing and what the supply chain currently supports.

Implication 2: Industrial policy

The implications for industrial policy are sharper and probably more actionable than the timeline question. Strategic chokepoint thinking has been applied to semiconductors but not yet to reducers, despite the geographic concentration being arguably tighter.

China has been pushing this question explicitly for a decade. Made in China 2025 named speed reducers, servomotors, and controllers, which together represent about 70 percent of robot bill-of-materials by value, as indigenization priorities.

The results are mixed and the literature is honest about it. A peer-reviewed PageRank-on-patents study by Liu et al. (ScienceDirect, 2025) found that Made in China 2025 lifted midstream and downstream robotics innovation quality but failed to move upstream component quality, which is exactly the reducer and sensor and controller layer. Leaderdrive's 30 to 40 percent share of China's harmonic-reducer market is real growth, but, as noted, sits on imported Japanese machine tools.

The US response has been narrow. The most concrete recent action is the Section 232 national-security investigation of robotics and industrial machinery imports, initiated by Commerce on September 2, 2025 (Federal Register notice 2025-18749). Public comments closed October 17, 2025; tariff or import-restriction decisions plausibly land in spring 2026. This is a trade tool, not a capex subsidy. The Humanoid ROBOT Act of 2025 (S.3275) extends Section 889-style federal procurement bans to humanoids from PRC, Iran, DPRK, and Russia-linked entities. That is a procurement-restriction bill, not a capacity-building bill.

The Information Technology and Innovation Foundation published "A Time to Act: Policies to Strengthen the US Robotics Industry" in July 2025, by Robert Atkinson. The numbers are striking. Japan produces 46 percent of global robotics output. By 2024 imports outweighed exports 4 to 1. Atkinson recommends expanding NIST Manufacturing USA programs and the Advanced Robotics for Manufacturing institute, plus restrictions on Chinese robotics imports. The recommendations stop short of invoking Defense Production Act Title III authorities or proposing CHIPS-style appropriation for robotics components.

The rare-earth angle compounds this. Robots need high-density permanent magnets, specifically NdFeB with heavy rare-earth additions of terbium or dysprosium, for the servo motors that pair with reducers. China's October 2025 MOFCOM Notices No. 61 and 62 require licenses for these magnets, with extraterritorial licensing for magnets made overseas using Chinese technology starting December 1, 2025. The notices were suspended until November 10, 2026, but the regulatory framework is in place. Industrial robots use roughly 300 to 500 grams of rare-earth-doped magnets per motor, and a humanoid has 20 to 40 motors. The supply-chain exposure is non-trivial.

What would real policy that takes the bottleneck seriously look like? Probably some combination of explicit chokepoint mapping treating precision reducers and rare-earth magnets together; capex incentives for domestic reducer manufacturing, which would in turn require investing in the machine-tool supply chain that the Liu et al. analysis shows is the harder upstream problem; and allied coordination with Japan rather than competition. The current US framing of trade tools and procurement bans is closer to the early-2010s semiconductor framing than to the post-2020 framing. The conceptual upgrade has not happened yet.

Implication 3: Embodied AI safety

The safety implications are the most speculative of the three. The embodied-AI safety literature is thin, fragmented, and dominated by self-driving cars, which the broader physical-AI community treats as a sibling discipline rather than the main event.

The conceptual foundation is Stuart Russell's Human Compatible (2019) and the Center for Human-Compatible AI's work on Cooperative Inverse Reinforcement Learning (Hadfield-Menell, Russell, Abbeel, Dragan, NeurIPS 2016). Russell's domestic-robot thought experiment, the robot that cooks the cat because it wasn't told cats are loved, is the canonical intuition pump. CIRL formalizes preference inference from behavior. This is rigorous theory with no deployed safety system attached to it.

Anthropic's Responsible Scaling Policy addresses biological, chemical, radiological, nuclear, and cyber risks plus autonomy, but does not name physical AI or robotics as a capability threshold in the public versions I have seen. OpenAI shuttered its robotics team in 2021 and has no published safety research on its more recent humanoid investments. The robotic foundation model labs themselves, including Physical Intelligence (π0), Open X-Embodiment, and Figure (Helix), have published essentially no public safety research either.

The empirical safety evidence comes almost entirely from autonomous vehicles. The Cruise robotaxi incident on October 2, 2023, in San Francisco, remains the single most documented physical-AI safety failure on public record. A Cruise vehicle dragged a pedestrian roughly 20 feet after she was struck into its path by a human-driven car. The California Public Utilities Commission suspended Cruise's permit on October 24, 2023. The Quinn Emanuel independent report (January 2024), commissioned by Cruise itself, documented that the company initially showed regulators a video that cut off before the dragging. The failure was both technical, in that the vehicle's reaction model didn't handle the secondary-impact case, and organizational, in that the company concealed it from regulators. Cruise's parent eventually wound down the program.

Tesla's Autopilot has generated a parallel safety pattern: an NHTSA recall in December 2023 covering approximately 2 million vehicles, with 13 documented fatal crashes. That is a regulator-driven case rather than a single dramatic incident.

Waymo has published the most rigorous public safety case. The Waymo Safety Hub and Kusano et al.'s 2024 comparison study report approximately 25 million rider-only miles with significant claimed reductions in police-reported and injury crashes versus human-driver baselines. The comparison-class selection is Waymo's, but the methodology is the strongest in the field.

The Epoch deployment-lag analysis changes this picture in three places, ordered by how grounded the observation is.

The slow-rollout-helps-safety argument breaks in the military vertical. Paul Scharre's Army of None (2018) and ongoing CNAS work document the trajectory. The OpenAI-Anduril partnership announced in December 2024 and the Anthropic-Palantir-AWS defense deal in November 2024 are real moves into a market where the customers, state militaries, are explicitly willing to absorb unit cost premiums that would crush commercial deployment. The reducer constraint applies to commercial humanoids assuming peacetime allocation. It does not apply if a state actor commits to scaling regardless of cost. The Epoch analysis implicitly assumes a peacetime industrial pattern. This is the strongest grounded counter to the slow-rollout-helps-safety case.

Physical-AI safety research has more runway than the discourse implies, at least on the commercial side. If commercial humanoid deployment is paced by supply chains running on industrial-time, the field has five to ten years to develop, test, and deploy safety techniques before the population of deployed embodied agents is large enough that incidents become routine. The current state, in which there are zero public safety publications from any major robotic foundation model lab, is therefore the bigger problem than the slow rollout. The runway exists. The research is not filling it.

And most speculatively: there is a thesis nobody has made canonically yet, that embodied AI is where alignment debates become empirical. Robots produce real-world actions with real-world consequences in a way text generators don't. As humanoids reach the deployed scale Epoch projects, the LLM discourse's mostly-theoretical alignment debates will have physical analogues. I'm flagging the thesis because the literature is open for someone to make it well. I'm not in a position to be that someone here.

Cruxes

The Epoch analysis is a snapshot, not a final answer. Five things could change it.

Recursive self-improvement is the largest deferred question. Epoch explicitly brackets the case where AI starts designing reducer manufacturing plants or developing the metallurgical processes that currently take Japanese firms decades to refine. If that happens, the bottleneck story changes. The analysis assumes ordinary human engineers run the scale-up. That is a reasonable simplification for current conditions and a bad one for any future where the AI is itself optimizing the supply chain.

Software substituting for hardware is the second crux. Physical Intelligence demonstrated precision manipulation tasks using relatively simple grippers, with fewer actuators and less reducer demand per unit, by improving the learned policy. If foundation models can compensate for mechanical imprecision the way deep learning has compensated for hand-engineered features, the bill-of-materials math shifts. Robots could become viable at lower precision tiers, which would change which suppliers matter and what the cap actually is.

Cohort versus price effects is the third. Industrial robots took 50 years to reach a million deployed units partly because of supply constraints and partly because the unit economics did not work for most applications until well into that window. If humanoid prices follow a fast cost curve, both because the AI labor cost is in the training run rather than the marginal robot and because EV-style battery and motor economies will apply, the reference class might be wrong. The bottleneck still binds the production curve, but the demand curve and the deployment curve might decouple from it differently.

Geopolitical fragmentation is the fourth and least quantifiable. The Epoch analysis assumes global trade keeps working. A Taiwan crisis, escalating sanctions, or a serious break in the Japan-China-US technology relationship would either collapse the supply chain or fragment it in ways that produce different and unpredictable constraints. The current rare-earth export-control framework is a small preview of the kinds of moves that could matter much more.

The fifth thing the analysis does not address is demand origin. Why does a shock happen, and when? Epoch's projection runs from an EOY 2027 trigger date, which is a placeholder. The whole analysis is "conditional on a shock, how fast can production scale" rather than "when will a shock happen and what kind." That is the right scope for the question they were asking, but it means the deployment timing is really two timelines stacked, capability arrival and then production response, and the post only addresses the second.

Where this leaves me

What I am watching for over the next eighteen months: whether HDS or Nabtesco announces capacity expansion specifically tied to humanoid demand at scale; whether the Section 232 investigation produces anything more substantial than tariffs; whether the major humanoid companies publish anything resembling a safety framework; and whether anyone writes the CSIS-style brief that frames the reducer-and-magnet chokepoint as the strategic problem it appears to be.

The most useful posture an outsider can take on physical AI right now is not prediction. It is noticing which questions the discourse has not gotten to yet.

The place I am most likely to be wrong is the software-substituting-for-hardware crux. If Physical Intelligence or another lab demonstrates production-grade manipulation at lower reducer counts than current humanoid architectures assume, the bottleneck story shifts and most of the post's three implications shift with it. I'd want to hear from anyone watching that closely.