Article | Sleepy, Strange Thinking
In December 2025, the long-rumored “Bean Phone” finally made its debut. It packed the Bean Phone Assistant technology preview into the Nubia M153 engineering prototype, with a launch price of 3499 yuan. The first batch of about 30,000 units sold out on the day of release. I remember that in the early days of its release, its price at the seafood market surged several times. The Beating Newsroom even bought two units.
The reason was not that it was a very usable phone. On the contrary, the first-generation Bean Phone, as a “technology preview version,” did not offer a good user experience. What excited us was that, for the first time, it pulled AI out of the chat box and transformed it from a chatbot into an AI Agent that could control a phone.
On the Bean Phone, AI could see the screen, understand the content you were browsing, hear you speak, switch between different apps, and directly help you with many tasks, such as checking tickets, price comparisons while shopping, coupon redemption, and photo editing. Although for sensitive tasks like payments, the user still needed to confirm, it could independently complete many operations that we used to click through one by one in the past.
Although it was still a bit clumsy, sometimes slow to respond, and would freeze at times, like someone who just learned to use a smartphone, it indeed allowed us to intuitively feel how convenient AI could be in daily life. Later, the Lobster was born and became a global sensation. The AI Agent became another iPhone moment in the AI field after ChatGPT was introduced, and a bunch of manufacturers and entrepreneurs began selling computers and phones preloaded with OpenClaw. The Bean Phone was ahead of them by at least one version, and it could even be said that the Bean Phone was a pioneer in this wave of Agent craze.
Unfortunately, the Bean Phone soon faced siege from major companies. WeChat, Taobao, Alipay, banking apps, and other scenarios began to block access or operations. Some said it was a “ban,” while others said it only triggered risk control, but for users, it made no difference – they just couldn’t use it anymore.
We are very regretful. The Bean Phone was certainly not a mature consumer electronics product, but it showed the industry a glimpse of the next-generation gateway. So even though the wave of excitement around the Bean Phone has passed, we still haven’t completely let go of this matter. Until recently, our daily information collection captured thousands of job postings, and analysis revealed that Byte seemed to be restarting its phone development.
Three Dimensions, One Clue
We crawled three dimensions from ByteDance’s official job page, namely AI Innovation Business, Mobile OS, and Douyin Mobile Assistant. After deduplication based on job ID, we further crawled the details, cross-referencing the job title, job description, and key requirements for keywords.
Unlike the recruitment of a regular AI App team, in this batch of ByteDance’s job openings, positions related to mobile systems, camera, touch, connectivity, battery life, heat dissipation, chip adaptation, structural design, overall device process, and production line testing also appeared. These terms are uncommon in internet companies; they are things that only mobile phone manufacturers, supply chain companies, and engineering teams deal with every day. ByteDance is hiring for factory-related positions. However, this does not necessarily mean ByteDance will develop its own mobile phone brand, but it does confirm that they are reinitiating the R&D work on mobile-level terminals.
Douyin Mobile Assistant: From Answering Questions to Performing Tasks for You
Let’s start with the Douyin Mobile Assistant. We conducted a more focused screening, searching for positions where the term “Douyin Mobile Assistant” appeared in the original data in the name, description, and requirements. In total, we found 83 such positions, which can be divided into three categories, forming the shape of a system-level AI Agent.
The first category of positions is responsible for enabling AI to act as an Agent. For example, the job posting for “Agent Development Engineer – Douyin Mobile Assistant” mentions the need to enable AI to perform task decomposition, context organization, tool invocation, memory retrieval, state management, result verification, and error recovery. These are the basic capabilities of all AI Agents we currently use.
The second category of positions is responsible for giving AI Agents a good memory. Positions appear for “perception and memory,” “user memory,” “personal knowledge graph,” and “long-term preferences.” If we want AI Agents to truly integrate into our lives, we cannot have them treat us as strangers every day; they need reliable and stable long-term memory. Of course, this easily touches upon privacy and boundary issues, but from the recruitment materials, ByteDance has at least started to consider “memory” as one of the most important capabilities of the Bean Coin mobile assistant.
The third category of positions is responsible for enabling the AI Agent to unleash those capabilities on the phone. If the Bean Coin mobile assistant is to operate the phone on behalf of the user, it cannot merely exist in the cloud, nor can it be just an app. It needs to have a full set of capabilities, including models, memory, task execution, edge deployment, system applications, audio and video, communication, testing, and quality assurance, in order to understand user speech, comprehend the environment, collaborate across devices, be always ready, and not cause trouble.
Mobile OS: The Real Challenge for the Agent Lies in the Phone’s Underlying System
Let’s look at the mobile OS. There are 236 positions related to the mobile OS, mainly based in Beijing, Shanghai, and Shenzhen. In the job descriptions, the recurring terms include kernel, chip, driver, camera, display, audio, network, power consumption, heat management, and mass production delivery. These are all terms that are closer to hardware and the underlying system of the phone.
As an example, the responsibilities of the “Kernel Leader – Mobile OS” position state that the individual must lead the memory and storage team in adapting and developing the kernel for a new Qualcomm platform, ensuring the system can cooperate with mainstream mobile chips and manage the memory and storage in the phone effectively. These capabilities are crucial for an AI Agent to achieve real-time responsiveness and handle tasks in the background.
Furthermore, terms such as SoC, BSP, and RTOS appear in the job descriptions. SoC can be roughly understood as the core chip of the phone, BSP is a set of underlying software that allows the system to communicate and cooperate with the hardware, and RTOS is often used in scenarios with high responsiveness and power consumption requirements. Therefore, the signal released by the mobile OS positions is that ByteDance is recruiting individuals who understand the mobile-level end system. They must at least know where the AI Agent running on the phone might encounter permission issues, power consumption challenges, system stability issues, and which problems need to be solved together with the chip, manufacturer, and testing team.
Location: Shenzhen – Signals of Hardware and Mass Production
It is also necessary to separately highlight those positions located in Shenzhen. If a position in Beijing leans more towards models, algorithms, and platforms, and a position in Shanghai leans more towards product and engineering, then a position in Shenzhen is often related to hardware, the supply chain, testing, and mass production. For a project that only involves cloud services, Shenzhen is not as crucial; but once it involves physical products, Shenzhen becomes very important.
What we see in relevant positions in Shenzhen are exactly these things. Some positions are titled Human-Computer Interaction Design, covering hardware physical interaction, software interface interaction, and multi-end interactive experience. These positions not only consider how to design the interfaces on the screen but also the feel of the physical device, buttons, how to wake it up, and how to interact with other devices. Then there are positions closer to the engineering site, such as interconnection, power consumption, short-range communication, baseband, whole machine process, structure, and test process. These terms are not as catchy as “intelligent agent,” “multimodal,” and “world model.” However, in the end, consumer electronics are determined by these things.
ByteDance Can’t Just Be an App
In the past, the phone was the container for apps; in the AI era, the phone might become the body of an agent. If the phone is just a container for apps, then a company like ByteDance can use content, algorithms, and product strength to build its kingdom through individual apps. But if the phone becomes the body of an agent, the user first issues a task, and whoever can take on the task will have the opportunity to decide the next steps.
In this scenario, apps will be downgraded to callable tools. This will make all super apps uncomfortable because agents naturally bypass the middleman. Therefore, the real challenge may not be whether Douyin can open an app, but whether others are willing to let it open. And an AI that can make decisions for users cannot be easily granted access like a regular app.
For an agent to move from the chatbox to the action layer, it must deal with a bunch of dirty work that used to be outside the AI team’s scope. They need to know when the system will kill the background process, when an operation will trigger risk control, why the phone is overheating, why the factory’s yield rate is not improving. These were things that the AI team used to not worry about, but now they are unavoidable. So ByteDance is recruiting for these positions. It may not necessarily launch a phone, but ByteDance definitely cannot just be an app in someone else’s phone anymore.
For a major tech company to become the next-generation user gateway, it cannot always rely on someone else’s operating system.
[BlockBeats]
ByteDance’s Mobile AI Agent Revival: Implications for Crypto’s Next Frontier
ByteDance’s apparent reboot of mobile development efforts, evidenced by extensive hiring across AI, mobile OS, and hardware domains, represents a significant strategic shift that could reshape the digital landscape. For crypto investors, this development signals several emerging opportunities and risks that extend far beyond traditional mobile ecosystems.
The Strategic Imperative Behind ByteDance’s Move
ByteDance isn’t simply developing another smartphone; they’re positioning themselves to control the next-generation user interface—the AI Agent that operates within the device itself. The “Bean Phone” concept, despite its initial limitations, demonstrated that AI could transcend chatbot functionality to become an active agent capable of understanding screen content, navigating apps, and performing tasks on behalf of users.
The critical insight here is that ByteDance recognizes the fundamental shift from “phones as app containers” to “phones as agent bodies.” In this new paradigm, users issue tasks rather than opening apps, creating a direct challenge to the walled gardens of platforms like WeChat, Taobao, and Alipay. For crypto investors, this signals the potential disruption of traditional digital gatekeepers—a dynamic that has historically benefited decentralized alternatives.
Crypto Market Implications and Investment Angles
1. AI Agent + Blockchain Convergence Projects
The most direct beneficiaries of this trend will be projects successfully integrating blockchain with sophisticated AI Agents. While ByteDance’s approach remains centralized, the underlying technology creates a template for decentralized alternatives. Projects that can offer:
- User-controlled AI agents with verifiable execution on-chain
- Decentralized marketplaces for specialized AI Agent services
- Tokenized AI capabilities that can be composed and monetized
Stand out particularly well. The AI Agent market is evolving from simple chat interfaces to task-oriented systems that perform actions, creating natural use cases for blockchain verification and micropayment systems.
2. Privacy and Data Sovereignty Solutions
As AI Agents gain deeper system access and more comprehensive memory capabilities—explicitly mentioned in ByteDance’s hiring for “user memory” and “personal knowledge graph”—the privacy implications become profound. This creates tailwinds for:
- Zero-knowledge proof systems that enable AI functionality without raw data exposure
- Self-sovereign identity solutions that give users control over their digital personas
- Decentralized data marketplaces where users can tokenize access to their personal data
ByteDance’s centralized approach inevitably faces trade-offs between functionality and privacy, creating opportunities for crypto-native solutions that don’t require such compromises.
3. Decentralized Compute and DePIN Ecosystems
The computational demands of sophisticated AI Agents—particularly those requiring real-time responsiveness and edge deployment—highlight the limitations of centralized cloud infrastructure. This favors:
- Decentralized compute networks like Akash (AKT) and Render (RNDR) that can offer specialized AI inference capabilities
- DePIN (Decentralized Physical Infrastructure Networks) that manage edge computing nodes
- Orchestration protocols that can distribute AI workloads across heterogeneous networks
The move toward edge AI—where processing happens closer to the device—aligns perfectly with the decentralized compute thesis, particularly for latency-sensitive applications.
4. Tokenized Economies for AI Services
As AI Agents become capable of performing complex tasks, the need for efficient microtransaction systems becomes paramount. Traditional payment rails are ill-suited for the high-volume, low-value transactions that characterize many AI Agent use cases. Crypto-native solutions offer:
- Programmable micropayments for AI service consumption
- Reputation systems built on-chain to establish trust in AI capabilities
- Incentive mechanisms for users to contribute to AI training or model refinement
The Douyin Mobile Assistant’s focus on “task decomposition, context organization, tool invocation, memory retrieval, and state management” suggests a complex operational stack that could benefit from token-based coordination.
Risks and Challenges
Despite the opportunities, several risks deserve attention:
-
Centralization Momentum: Tech giants like ByteDance have significant advantages in data, users, and capital that could delay the decentralization narrative. Their approach of “AI Agents as system-level features” may prove more immediately compelling than crypto alternatives.
-
Regulatory Uncertainty: AI Agents that control devices and perform actions on behalf of users will inevitably face heightened regulatory scrutiny, potentially creating compliance challenges for crypto-native projects.
-
Execution Risk: The complexity of building truly functional AI Agents—particularly those that can reliably operate across different apps and contexts—should not be underestimated. Many promising concepts may fail to deliver practical utility.
-
Market Timing: The crypto market’s volatility means that even well-constructed AI-blockchain convergence projects may face significant headwinds during broader market downturns.
Strategic Recommendations for Investors
-
Focus on Differentiation: Rather than competing directly with centralized tech giants on functionality, crypto projects should emphasize verifiable execution, user control, and composability—areas where decentralized solutions have inherent advantages.
-
Prioritize Vertical Integration: The most promising projects are those that control multiple layers of the stack—from data collection and model training to inference and user interface. Crypto-native projects should aim for similar integration where possible.
-
Monitor Adoption Signals: Pay close attention to real-world usage metrics of AI Agents, particularly in markets with high crypto adoption. Early indicators of agent-driven behaviors could preced broader market shifts.
-
Balance Innovation and Practicality: While ambitious visions are important, prioritize projects with clear paths to near-term utility and revenue generation. The AI Agent revolution will likely unfold gradually, creating opportunities for pragmatic solutions.
ByteDance’s renewed focus on mobile AI Agents represents more than just a product strategy—it’s a bet on the future of human-computer interaction. For crypto investors, this trend underscores the importance of building decentralized alternatives that offer not just functionality, but verifiability, user control, and new economic models. The companies that successfully navigate this intersection of AI and blockchain may well define the next decade of digital innovation.