Caixin

In Depth: Humanoid Robots Need a Job

Published: Aug. 15, 2025  7:14 p.m.  GMT+8
00:00
00:00/00:00
Listen to this article 1x
Humanoid robots sit at the World Robot Conference in Beijing on Aug. 8. Photo: Liu Peilin/Caixin
Humanoid robots sit at the World Robot Conference in Beijing on Aug. 8. Photo: Liu Peilin/Caixin

This year’s World Robot Conference (WRC) in Beijing, which wrapped up on Tuesday, hosted more than 200 robotics companies that showcased over 1,500 products, including humanoid robots capable of doing all sorts of things such as running, dancing and boxing.

Behind the slick demonstrations at the five-day event was a stark reality. While these robots can mimic certain human actions, they struggle to interact with their environment in real time or perform more complicated tasks. This has confined their uses largely to simpler roles in fields like customer service, with their application in industrial settings just beginning. For them to do more will require key technological breakthroughs.

loadingImg
You've accessed an article available only to subscribers
VIEW OPTIONS

Unlock exclusive discounts with a Caixin group subscription — ideal for teams and organizations.

Subscribe to both Caixin Global and The Wall Street Journal — for the price of one.

Share this article
Open WeChat and scan the QR code
DIGEST HUB
Digest Hub Back
Explore the story in 30 seconds
  • Over 200 companies at the 2024 Beijing World Robot Conference showcased 1,500+ products, but humanoid robots remain limited to simple tasks due to technological and data challenges.
  • Humanoid robots are expensive, with prices from tens to hundreds of thousands of yuan; advanced dexterous hands significantly increase costs.
  • Despite high costs and limited commercial deployment, investment and interest in embodied AI and humanoid robotics remain strong, fueling a competitive but risky funding environment.
AI generated, for reference only
Explore the story in 3 minutes

The 2024 World Robot Conference (WRC) in Beijing recently concluded, featuring participation from over 200 robotics companies and showcasing more than 1,500 products, including humanoid robots capable of running, dancing, and boxing. Despite these impressive demonstrations, significant technological limitations remain. These humanoid robots can mimic human actions but still struggle to interact dynamically with their environments or perform complex tasks, restricting their practical use mainly to simpler roles like customer service, while broader industrial adoption is just beginning. The advancement of more useful robots will require major breakthroughs in technology [para. 1][para. 2].

According to industry insiders, commercial viability remains an issue, with profitability for even the supposedly market-ready wheeled humanoid robots being distant. Some attempts to make robots more affordable, such as Unitree Robotics pricing models as low as 40,000 yuan ($5,566), mean compromises in capability. At WRC, robot prices ranged from tens to hundreds of thousands of yuan, with more advanced robots (capable of folding clothes or making coffee) costing up to 500,000 yuan. For household ubiquity, costs across the entire supply chain must drop considerably [para. 3][para. 4][para. 5].

Investor enthusiasm, however, persists. Kang Yu from Shoucheng Holdings noted the speed with which embodied AI financing is proceeding, with several firms eyeing IPOs. Embodied AI, which gives intelligent machines the ability to interact physically with the real world, is a key focus [para. 6][para. 7].

A significant debate exists over the main bottleneck for embodied AI: model architecture versus the quality and quantity of training data. Unitree Robotics CEO Wang Xingxing argues that, while hardware is sufficiently advanced, AI models lag behind large language models and data accumulation strategies that worked in language AI are less effective here. Others, like X Square Robot and Robot Era, believe VLA (vision-language-action) models or a mixture of models and high-quality real-world data are critical to progress, with each emphasizing different aspects as the core challenge [para. 8][para. 9][para. 10][para. 11][para. 12][para. 13][para. 14].

Further, production and hardware costs present formidable obstacles. High-end consumer pricing persists due to the need for powerful computing systems and sophisticated components like dexterous hands. For example, Digit Huaxia’s humanoid robot Xing Xing Xia starts at 198,000 yuan, justified by the need for complex interaction. More advanced dexterous hands, which can handle intricate tasks, can cost from 10,000 yuan for basic models to 50,000 yuan or more for models suitable for commercial applications like coffee-making. As tasks grow in complexity, hardware costs climb proportionately [para. 17][para. 18][para. 19][para. 20][para. 21][para. 22].

Despite scalability challenges, funding for embodied AI is surging. Investor interest was boosted by a high-profile dancing robot performance during the Chinese Spring Festival Gala and substantial fundraising rounds among key Chinese robotics startups. Most investment is directed toward data collection—costs for comprehensive data gathering centers can easily reach into hundreds of millions of yuan. Local governments, such as Tianjin, are supporting the industry, as seen with new AI data facilities [para. 24][para. 25][para. 26][para. 27][para. 28][para. 29].

Some voices caution that a financial bubble is forming, suggesting that only firms generating substantial annual revenue (at least 400 million yuan) will survive the coming consolidation. This signals an impending shakeout where only companies with sustainable long-term potential may persist [para. 30][para. 31].

AI generated, for reference only
Who’s Who
Unitree Robotics
Unitree Robotics is a Chinese company that participated in this year’s World Robot Conference. They have lowered the starting price of at least one of their humanoid robot models to around 40,000 yuan ($5,566). The CEO, Wang Xingxing, believes robot hardware is advanced enough and the bottleneck is embodied AI models, not data accumulation. The company recently completed a Series C funding round in June, following a high-profile appearance at the Spring Festival Gala.
AgiBot
AgiBot is an embodied AI firm that has launched end-to-end vision-language-action (VLA) models. They are among the companies that have embraced the use of "data factories" to collect data for training robots, sometimes by employing workers to operate robots or wear specialized gear. They also generate simulated environmental data to train robots for human-like tasks.
Galbot
Galbot (星动纪元) is one of several embodied AI firms mentioned in the article. Along with AgiBot and Galaxea AI, Galbot has launched end-to-end vision-language-action (VLA) models, signifying their involvement in developing advanced AI for robots that can interact with the physical world.
Galaxea AI
Galaxea AI is one of several embodied AI firms that have launched end-to-end VLA (vision-language-action) models in the past two years. Recent reports indicate that Galaxea AI, alongside other developers like DexForce Technology and Tars, successfully raised funds last month, contributing to a "funding frenzy" in the embodied AI sector in China.
X Square Robot
X Square Robot is a Chinese embodied AI company. Its CEO, Wang Qian, believes that future general-purpose robot models will incorporate "vision," "language," and "action," advocating for VLA models. He stresses the critical importance of high-quality data for technological progress, with X Square Robot developing its own data collection and processing methods to overcome this bottleneck.
Robot Era
Robot Era is a Chinese humanoid robotics firm founded by Chen Jianyu. The company recently raised funds during an investment spree in embodied AI in China. Chen believes that model architecture is the top concern for embodied AI, followed by high-quality data, with robot hardware quality being the least critical factor.
AI² Robotics
AI² Robotics is a Chinese embodied AI developer. Mo Lei, their vice president, suggested that current embodied AI models lack sufficient power. This makes humanoid robots difficult to use "out-of-the-box" and necessitates manual tuning for factory applications.
DexForce Technology
DexForce Technology recently raised funds, as part of a surge of investment in embodied AI developers in China. They are among several companies that have secured funding, indicating a high level of investor interest in the field despite current limitations in humanoid robot capabilities.
Tars
Tars is one of several embodied AI firms, including DexForce Technology, Galaxea AI, and Robot Era, that recently raised funds in a significant investment spree in China's embodied AI sector. These companies primarily aim to use the capital to gather data for training their AI models.
Digit Huaxia
Digit Huaxia (狗尾草智能科技) is a Shenzhen-based robotics company. They offer a humanoid robot named Xing Xing Xia, priced at 198,000 yuan. This higher price is attributed to its expensive computing system, which powers its "brain" for commercial applications requiring human interaction.
Botyard Intelligence
Liu Xin is the CEO of Botyard Intelligence, a Shenzhen-based company specializing in dexterous hands for robots. He explains that while lower-spec dexterous hands are inexpensive and sufficient for display purposes like dancing, they generally aren't capable enough for complex tasks required in industrial settings or daily activities like making coffee.
AI generated, for reference only
Subscribe to unlock Digest Hub
SUBSCRIBE NOW
NEWSLETTERS
Get our CX Daily, weekly Must-Read and China Green Bulletin newsletters delivered free to your inbox, bringing you China's top headlines.

We ‘ve added you to our subscriber list.

Manage subscription
PODCAST
Caixin Deep Dive: Why Singapore Sovereign Fund Sues Chinese EV-Maker Nio
00:00
00:00/00:00