In Depth: Humanoid Robots Need a Job
Listen to the full version

This year’s World Robot Conference (WRC) in Beijing, which wrapped up on Tuesday, hosted more than 200 robotics companies that showcased over 1,500 products, including humanoid robots capable of doing all sorts of things such as running, dancing and boxing.
Behind the slick demonstrations at the five-day event was a stark reality. While these robots can mimic certain human actions, they struggle to interact with their environment in real time or perform more complicated tasks. This has confined their uses largely to simpler roles in fields like customer service, with their application in industrial settings just beginning. For them to do more will require key technological breakthroughs.

Unlock exclusive discounts with a Caixin group subscription — ideal for teams and organizations.
Subscribe to both Caixin Global and The Wall Street Journal — for the price of one.
- DIGEST HUB
- Over 200 robotics companies showcased 1,500+ products at the 2024 World Robot Conference in Beijing, including humanoids capable of running, dancing, and boxing.
- Humanoid robots face high costs and limited capabilities, restricting them mainly to simple customer service roles; significant price cuts and advances in embodied AI are needed for mass adoption.
- Despite immature technology and high production costs, investor interest and funding in embodied AI remain high, though industry consolidation may occur soon.
The 2024 World Robot Conference (WRC) in Beijing drew over 200 robotics companies showcasing more than 1,500 products, including advanced humanoid robots that can run, dance, and box. Despite these impressive demonstrations, most robots displayed face significant technical challenges and are limited to simple customer service roles, with industrial applications still in their infancy due to a lack of real-time interaction and complex task abilities. Major technological breakthroughs are needed to expand their utility. Even cost-competitive models, such as Unitree Robotics’ humanoid robots priced as low as 40,000 yuan ($5,566), remain limited in capability, and no companies have achieved profitability at current price points. More sophisticated robots for tasks like folding clothes or making coffee can cost up to 500,000 yuan, making widespread household adoption unlikely unless supply chain costs drop significantly [para. 1][para. 2][para. 3][para. 4][para. 5].
Investor interest, however, remains strong, with high hopes for embodied artificial intelligence (AI)—AI-enabled robots with physical forms—driving rapid financing and numerous firms preparing for initial public offerings. The race is fueled by the belief that embodied AI systems could transform multiple sectors, yet tremendous challenges persist [para. 6][para. 7].
A central debate at the WRC concerned the primary bottleneck in developing embodied AI. While Unitree CEO Wang Xingxing asserts that robot hardware has matured and the real constraint is lagging AI models, others in the industry disagree. Some companies invest heavily in collecting operational and simulated data to train robots, following the vision-language-action (VLA) model approach, which seeks to integrate sensory input, reasoning, and physical response. Yet, developing real-world, high-quality data at scale remains expensive and complex. X Square Robot’s CEO emphasizes both architecture and data, citing decades of trial and error favoring end-to-end, general-purpose models, while Robot Era’s founder stresses model design as the key factor, followed by data quality and hardware [para. 8][para. 9][para. 10][para. 11][para. 12][para. 13][para. 14][para. 15][para. 16].
Despite progress, embodied AI models are not sufficiently advanced to allow out-of-the-box industrial deployment, necessitating manual tuning and reliance on traditional industrial robotics technologies for many tasks. Only a fraction of robot features (e.g., advanced object recognition and dexterity) represent current embodied AI; the rest are improvements on older capabilities [para. 17].
Pricing remains a significant barrier. Even as manufacturers introduce more affordable robot models, most remain high-end products targeted at commercial buyers due to expensive computing systems and advanced components. For example, Digit Huaxia’s humanoid robot starts at 198,000 yuan. Dexterous robot hands, crucial for complex tasks, vary significantly in price and ability, with low-end models (10,000 yuan) insufficient for demanding tasks like coffee-making, necessitating upgrades to models costing 50,000 yuan or more [para. 18][para. 19][para. 20][para. 21][para. 22].
Despite limited commercial applications so far, a funding frenzy is underway, especially after notable public showcases like Unitree’s performance at the Spring Festival Gala and several high-profile funding rounds among Chinese AI developers. Much of the raised capital is directed toward massive data collection efforts, with estimates suggesting over 1 billion yuan may be needed to train a humanoid robot suitable for retail work. Local governments, such as Tianjin’s, are supporting the industry with dedicated AI data facilities [para. 23][para. 24][para. 25][para. 26][para. 27][para. 28].
Nevertheless, concerns of a speculative bubble and industry shake-out loom. Some investors warn that only companies generating substantial revenues (at least 400 million yuan) will survive the next year, with many at risk of failure as the market corrects itself [para. 29][para. 30].
- Unitree Robotics
- Unitree Robotics is a Chinese company in the humanoid robot industry. Its CEO, Wang Xingxing, believes that robot hardware is advanced enough, but embodied AI model development lags. The company has lowered the starting price for some humanoid models to around 40,000 yuan but this leads to limited capabilities. Unitree Robotics recently completed a Series C funding round.
- Shoucheng Holdings Ltd.
- Shoucheng Holdings Ltd. is a state-linked tech investment company. Kang Yu, their general manager of the board office, noted accelerated investor interest in embodied AI, with many firms nearing IPOs.
- AgiBot
- AgiBot is an embodied AI firm that has launched end-to-end VLA (vision-language-action) models. They are among several companies actively involved in the development of embodied AI, with some setting up "data factories" to collect operational or simulated environment data for training robots.
- Galbot
- Galbot is an embodied AI firm that has launched end-to-end vision-language-action (VLA) models. Some companies like Galbot are setting up "data factories" or generating simulated environment data to train robots to perform human-like tasks.
- Galaxea AI
- Galaxea AI is a company that develops embodied artificial intelligence. They are among several firms that have successfully launched end-to-end vision-language-action (VLA) models. In a recent investment spree, Galaxea AI, along with other developers, successfully raised funds.
- X Square Robot
- X Square Robot is a Chinese embodied AI company. Its CEO, Wang Qian, asserts that future general-purpose robot models will incorporate "vision," "language," and "action" (VLA models) and that end-to-end models offer the most viable technical path. X Square Robot emphasizes the importance of high-quality data and has developed its own collection equipment and processing models to address this bottleneck.
- Robot Era
- Robot Era is a Chinese humanoid robotics firm. Its founder, Chen Jianyu, views model architecture as the most critical factor for embodied AI development, followed by high-quality data, with robot hardware quality ranking third.
- AI² Robotics
- AI² Robotics is a Chinese embodied AI developer. Mo Lei, their vice president, highlighted that current embodied AI models lack sufficient power, necessitating manual tuning for humanoid robots used in factory settings and preventing out-of-the-box utility.
- Digit Huaxia
- Digit Huaxia is a Shenzhen-based robotics firm that manufactures humanoid robots. Their robot model, Xing Xing Xia, starts at 198,000 yuan. The high price is attributed to its expensive computing system, necessary for commercial use requiring human interaction.
- Botyard Intelligence
- Botyard Intelligence is a Shenzhen-based company that specializes in developing dexterous hands for robots. Its CEO, Liu Xin, notes that the price of these hands varies significantly based on their capabilities, with more advanced models designed for complex tasks costing more. Low-spec, inexpensive hands are often used for aesthetic purposes in dancing robots, while higher-priced, more capable hands are necessary for practical applications like factory work or making coffee.
- DexForce Technology
- DexForce Technology is an embodied AI developer that recently raised funds during an investment spree in the embodied AI sector in China. This funding trend highlights significant investor interest in advanced robotics despite current limitations in commercial scalability.
- Tars
- Tars is a Chinese embodied AI developer that recently raised funds, alongside other companies like DexForce Technology, Galaxea AI, and Robot Era. The article does not provide further details about Tars, such as its specific products or founding date.
- Lanchi Ventures
- Lanchi Ventures is an investment firm whose partner, Cao Wei, has expressed concerns about a potential bubble forming in the embodied artificial intelligence (AI) sector. The company plans to prioritize investments in firms with long-term development potential, anticipating that some less robust companies may not survive into the latter half of next year.
- PODCAST
- MOST POPULAR