personalrobot.dev | meta

personalrobot.dev

#Personal Robot | Development

#Digital Twin of Person

#Non Verbal Cue

#Motion Control

#AI models deployed in embedded systems at edge | Brushless DC motors | Hall effect sensors | Optical encoders | Sensorless motor control | Field-oriented control | Artificial intelligence at edge | Three fundamental modalities: vision, sound, and motion | Using AI models to infer information about device environment | Linear algorithms | Software and hardware combination | Deploying multiple AI models in embedded devices requires edge processors designed to run AI | Embedded systems using AI can be considered open | Sensor fusion utilizes combined data from multiple sensors | AI-based vision systems are more adaptable to natural variations inherent in object inspection | Objects can be identified and inspected more quickly with greater flexibility | Strong multimodal AI, a single model will process multiple types of data | Control algorithms will use inputs generated by AI, inferred from multiple sources of data | AI inferencing in data flow | AI-enabled image sensors are perfect for gesture detection | Event detection based on sound is an active area of development | On device learning in real time

#Robotic Platform

#Machine Learning

#Computer Vision

#Gesture Control

#Stuntronics

#Superhuman manoeuvres

#Task Intelligence

#Transferlearning

#Untethered dynamic movement

#Onboard sensing

#Flips, twists and poses with repeatability and precision

#Extremely precise aerial motion control

#Differentiable simulation

#Reinforcement learning

#Dynamic robotic performance

#Gymnastic action

#Autonomous figure

#Flips with repeatability and precision

#Twists with repeatability and precision

#Poses with repeatability and precision

#personalrobot.ai

#personalrobot.app

#personalrobot.dev

#Problem Detecting

#Help Calling

#autonomousrobot.app

#autonomousrobot.dev

#nanotech skin

#Prompt engineering

#Speech Recognition

#Large language model (LLM)

#Improving control policies

#Upgrading actuation

#Minimizing joint complexity

#Three-fingered grippers

#Indoor deployments

#Deploying n fleets

#Functionally safe 3D vision

#Stepping toward greater levels of autonomy

#Making robots knowledgeable about different types of objects

#How to grasp objects

#Avoiding humanoid winter

#1550nm LiDAR | Advantages: safety, range, and performance in various environmental conditions | Enhanced Eye Safety: absorbed more efficiently by cornea and lens of eye, preventing light from reaching sensitive retina | Longer Detection Range | Improved Performance in Adverse Weather Conditions such as as fog, rain, or dust | Reduced Interference from Sunlight and Other Light Sources | More expensive due to complexity and lower production volumes of their components

#SLAM | Simultaneous Localization and Mapping

#Ultra sonic piezo motor

#Vector database

#Resistive RAM (ReRAM) technology | onsemi Treo platform to provide embedded non-volatile memory | ReRAM integration into Bipolar CMOS DMOS (BCD) process | Potential alternative to flash memory | Demand for faster, more efficient, and scalable memory solutions increasing | Lower power consumption | Less vulnerable to common hacking tactics | ReRAM can be integrated easily into chip designs without interfering with power analog components

#A-list celebrity home protector | Burglaries targeting high-end items | Burglary report on Lime Orchard Road | Burglar had smashed glass door of residence | Ransacked home and fled | Couple were not home at the time | Unknown whether any items were taken | Lime Orchard Road is within Hidden Valley gated community of Los Angeles in Beverly Hills | Penelope Cruz, Cameron Diaz, Jennifer Lawrence, Adele and Katy Perry have purchased homes there, in addition to Kidman and Urban | Kidman and Urban bought their home for $4.7 million in 2008 | 4,100-square-foot, five-bedroom home built in 1965 and sits on 1¼-acre lot | Property large windows have views of the canyons | Theirs is one of several celebrity properties burglarized in Los Angeles and across country recently | Connected to South American organized-theft rings

#Professional athlete home protector | South American crime rings | Targeting wealthy Southern California neighborhoods for sophisticated home burglaries | Behind burglaries at homes of professional athletes and celebrities | Theft groups conduct extensive research before plotting burglaries | Monitoring target whereabouts and weekly routines via social media | Tracking travel and schedules | Conducting physical surveillance at homes | Attacks staged while targets and their families are away | Robbers aware of where valuables are stored in homes prior to staging break-ins | Burglaries conducted in short amount of time | Bypass alarm systems | Use Wi-Fi jammers to block Wi-Fi connections | Disable devices | Cover security cameras | Obfuscate identities

#Vision Language model | Data preparation, alignment techniques and optimization methods necessary for embedding visual understanding capabilities within resource-constrained environments

#ROS 2 | The second version of the Robot Operating System | Communication, compatibility with other operating systems | Authentication and encryption mechanisms | Works natively on Linux, Windows, and macOS | Fast RTPS based on DDS (Data Distribution Service) | Programming languages: C++, Python, Rust

#Dexterous robot | Manipulate objects with precision, adaptability, and efficiency | Dexterity involves fine motor control, coordination, ability to handle a wide range of tasks, often in unstructured environments | Key aspects of robot dexterity include grip, manipulation, tactile sensitivity, agility, and coordination | Robot dexterity is crucial in: manufacturing, healthcare, logistics | Dexterity enables automation in tasks that traditionally require human-like precision

#Agentic AI | Artificial intelligence systems with a degree of autonomy, enabling them to make decisions, take actions, and learn from experiences to achieve specific goals, often with minimal human intervention | Agentic AI systems are designed to operate independently, unlike traditional AI models that rely on predefined instructions or prompts | Reinforcement learning (RL) | Deep neural network (DNN) | Multi-agent system (MAS) | Goal-setting algorithm | Adaptive learning algorithm | Agentic agents focus on autonomy and real-time decision-making in complex scenarios | Ability to determine intent and outcome of processes | Planning and adapting to changes | Ability to self-refine and update instructions without outside intervention | Full autonomy requires creativity and ability to anticipate changing needs before they occur proactively | Agentic AI benefits Industry 4.0 facilities monitoring machinery in real time, predicting failures, scheduling maintenance, reducing downtime, and optimizing asset availability, enabling continuous process optimization, minimizing waste, and enhancing operational efficiency

#Multipurpose commercial humanoid | Potential for useful and reliable and affordable humanoids | Difficult problem making highly technical piece of hardware and software compete effectively with humans in labor market | Robots are not hard to build; but they are hard to make useful and make money with | Whole perception pipeline running at the framerate of sensors nowadays | All the technology is here now | Starting with surrogate robot from someone else to get autonomy team going while building own robot in parallel | Giving out a significant chunk of the company to early joiners | Combined efforts of the research community enables commercialization | Building team is really important

#Humanoid robots and fashion future | Shanghai, humanoid robots transcend fashion hype, reimagining design, challenging beauty norms, and unlocking metaverse opportunities | Convergence of fashion and technology | Human-machine collaboration in fashion | Genuine, emerging trend | Creativity, production, and human-machine interaction | Robots are becoming experimental platforms | Integration of robots into runway | Aesthetic Reinvention: designing beyond the human form | Fostering Human-Robot Collaboration From Runway to Production and Retail | Challenging Beauty Norms | Paving Way for Future Trajectories: The Metaverse of Fashion

#Large Language Model (LLM) | Foundational LLM: ex Wikipedia in all its languages fed to LLM one word at a time | LLM is trained to predict the next word most likely to appear in that context | LLM intellugence is based on its ability to predict what comes next in a sentence | LLMs are amazing artifacts, containing a model of all of language, on a scale no human could conceive or visualize | LLMs do not apply any value to information, or truthfulness of sentences and paragraphs they have learned to produce | LLMs are powerful pattern-matching machines but lack human-like understanding, common sense, or ethical reasoning | LLMs produce merely a statistically probable sequence of words based on their training | LLMs are very good at summarizing | Inappropriate use of LLMs as search engines has produced lots of unhappy results | LLM output follows path of most likely words and assembles them into sentences | Pathological liars as a source for information | Incredibly good at turning pre-existing information into words | Give them facts and let them explain or impart them

#Retrieval Augmented Generation. (RAG LLM) | Designed for answering queries in a specific subject, for example, how to operate a particular appliance, tool, or type of machinery | LLM takes as much textual information about subject, user manuals and then pre-process it into small chunks containing few specific facts | When user asks question, software system identifies chunk of text which is most likely to contain answer | Question and answer are then fed to LLM, which generates human-language answer in response to query | Enforcing factualness on LLMs

#Ethernet Cameras | Ethernet Vision

#Unitree R1 humanoid | Agile mobility: 24-26-DOF for adaptation to complex scenarios; its 2-DOF head enhances environmental perception | Lightweight structure, easy maintenance: ≤121cm agile form, ultra-lightweight at about 25kg, ready out-of-the-box to empower | Integrated with Large Multimodal Model for voice and images: Fully open control interfaces for joints and sensors, with support for mainstream simulation platforms | Height Width and Thickness（Stand): 1210x357x190mm | Degree of Freedom(Total Joints): 24 | Single Leg Degrees of Freedom: 6 | Single Arm Degrees of Freedom: 5 | Waist Degrees of Freedom: 2 | Head Degrees of Freedom: None | Dexterous Hand: NOT | Joint output bearing: Crossed roller bearings, Double Hook Ball Bearings | Joint motor: Low inertia high-speed internal rotor PMSM(permanent magnet synchronous motor,better response speed and heat dissipation) | Maximum Torque of Arm Joint: 约 2kg | Calf + Thigh Length: 675 | Forearm + Upper Arm Length: 435 | Joint Movement Space: Waist Joint：Y±150° R±30°, Knee Joint：-10°~+148°, Hip Joint：Y:±157° P:-168° ~+146° R:-60° ~+100° | Electrical Routing: Hollow + Internal Routing | Joint Encoder: Dual + single encoder | Cooling System: Local air cooling | Power Supply: Lithium battery | Basic Computing Power: 8-core high-performance CPU | Microphone Array: 4-Mic Array | Speaker: YES | WiFi 6 | Bluetooth 5.2 | Humanoid Binocular Camera | NVIDIA Jetson Orin Optional (40-100 Tops) | Smart Battery (Quick Release) | Charger | Manual Controller | Battery Life: about 1h | Upgraded Intelligency: OTA | Warranty Period: 8 Months

#Large Behavior Model (LBM) | Controlling the entire robot actions | Joint research partnership between Boston Dynamics and Toyota Research Institute | Collaboration aims to create a general-purpose humanoid assistant | Whole-body movements: walking, crouching, and lifting to complete tasks that involve sorting and packing

#AI generalist robot | Developing end-to-end language-conditioned policies | Taking full advantage of capabilities of humanoid form factor, including taking steps, precisely positioning its feet, crouching, shifting its center of mass, and avoiding self-collisions | Building policies process: 1. Collect embodied behavior data using teleoperation on both real-robot hardware and in simulation, 2. Process, annotate, and curate data to easily incorporate it into machine learning pipeline, 3. Train neural-network policy using all of the data across all tasks | 4. Evaluate the policy using a test suite of tasks | Policy maps inputs consist of images, proprioception, language prompts to actions that control robot at 30Hz | Leveraging diffusion transformer together with flow matching loss to train model | Dexterous manipulation including part picking, regrasping | Subtasks triggered by passing a high-level language prompt to the policy | Reacting intelligently when things go wrong | With Large Behavior Model (LBM), training process is the same whether it is stacking rigid blocks or folding a t-shirt: if you can demonstrate it, robot can learn it | Speeding up the execution at inference time without requiring any training time changes

#Teleoperation | High-Quality Data Collection for Model Training | Control system allows to perform precise manipulation while maintaining balance and avoiding self-collisions | VR headset for operators to fully immerse themselves in the robot workspace and have access to the same information as the policy, with spatial awareness bolstered by a stereoscopic view rendered using head mounted cameras reprojected to the user viewpoint | Custom VR software provides teleoperator with a rich interface to command robot, providing them real-time feeds of robot state, control targets, sensor readings, tactile feedback, and system state via augmented reality, controller haptics, and heads-up display elements | One-to-one mapping between user and robot (i.e. moving your hand 1cm would cause robot to also move by 1cm) | To support mobile manipulation, tracking on feet added and teleoperation control extended to support stance mode, support polygon, and stepping intent to match that of operator

#Policy | Toyota Research Institute.Large Behavior Model | Diffusion Policy-like architecture | Boston Dynamic policy | Diffusion Transformer-based architecture | Flow-matching objective | Conditioned on proprioception, images | Accepting language prompt that specifies objective to robot | Image data comes in at 30 Hz | Network uses a history of observations to predict an action-chunk | Observation space consists of images from robot head-mounted cameras along with proprioception | Action space includes joint positions for left and right grippers, neck yaw, torso pose, left and right hand pose, and left and right foot poses | Shared hardware and software across two robots aids in training multi-embodiment policies that can function across both platforms, allowing to pool data from both embodiments | Quality assurance tooling allows to review, filter, and provide feedback on data collected

#Simulation | Allows to quickly iterate on teleoperation system and write unit and integration tests | Performing informative training and evaluations that would otherwise be slower, more expensive and difficult to perform repeatably on hardware | Simulation stack is faithful representation of hardware and on-robot software stack | Ability to share data pipeline, visualization tools, training code, VR software and interfaces across both simulation and hardware platforms | Benchmarking policy and architecture choices | Incorporating simulation as a significant co-training data source for multi-task and multi-embodiment policies deployed on hardware

#Whole body control and enhanced teleoperation | Sim-first approach | Imitation learning | Motion planner-based workflow for generating data in manipulation tasks | Dexterous manipulation tasks | Dictionary observation space for perception and proprioception | Automatic Domain Randomization (ADR) | Remote manipulation and locomotion control | Dexterous retargeting: the process of translating human hand configurations to robot hand joint positions for manipulation tasks | Human-provided subtask segments | Multiphase planning (approach, contact, retreat) | Dynamic object attachment and detachment | Synthetic data generation (SDG) | Robot knowing its location | Loco-manipulation (coordinated execution of locomotion and manipulation) | Policy evaluation framework for scalable simulation-based experimentation

#Immediate.Measures to Increase American Mineral Production

#Critical minerals in Artificial Intelligence | At the core of AI transformation lies a complex ecosystem of critical minerals, each playing a distinct role | Boron: used to alter electrical properties of silicon | Silicon: fundamental material used in most semiconductors and integrated circuits | Phosphorus: helps establish the alternating p-n junctions necessary for creating transistors and integrated circuits | Cobalt: used in metallisation processes of semiconductor manufacturing | Copper: primary conductor in integrated circuits | Gallium: used in compound semiconductors such as gallium arsenide (GaAs) and gallium nitride (GaN) | Germanium: used in high-speed integrated circuits and fibre-optic technologies | Arsenic: employed as a dopant in silicon-based semiconductors | Indium phosphide: widely used in optical communications | Palladium: used in production of multi-layer ceramic capacitors (MLCCs) | Silver: the most conductive metal used in specialised integrated circuits and circuit boards | Tungsten: serves as a key material in transistors and as a contact metal in chip interconnects | Gold: used in bonding wires, connectors, and contact pads in chip packaging | Europium: enables improved performance in lasers, LEDs, and high-frequency electronics essential to AI systems and optical networks | Yttrium: improves the efficiency and stability of materials like GaN and InP, supporting advanced applications in photonics, high-speed computing, and communications technologies

#Critical minerals for Optics, Imaging & Advanced Materials | Graphite: high-speed electronics, advanced sensors, and thermal management systems | Copper: short-distance data transmission in AI data centres | Germanium: a key material in thermal imaging, night-vision optics, and fibre-optic communication systems | Indium: optical communication systems | Praseodymium: specific types of lasers and optical materials | Neodymium:solid-state lasers | Holmium: specialised laser systems, particularly medical and scientific applications

#Critical minerals for Power Supply & Batteries | Lithium: portable electronics, wearables, electric vehicles | Graphite: stores lithium ions during charging process and releases them during discharge | Manganese: used in various lithium-ion battery chemistries | Cobalt: critical to the performance of premium mobile and computing devices | Nickel: crucial for electric vehicles, high-performance electronics, and energy-intensive AI systems

#Silicon Photonics | Chip-scale implementation of opto-electronic systems on silicon substrates | Electro-optic transceivers in both the short distance datacom and high-performance coherent optical communications segments | Light detection and ranging, LiDAR | Optical coherence tomography | Material integration | Advanced assembly concepts | Advanced signal processing schemes | Emerging applications in biology | Emerging computation platforms | aiXscale Photonics spin off

#Humanoid Robots and Fashion Future | Humanoid robots transcend fashion hype | Re-imagining design | Challenging beauty norms | Unlocking metaverse opportunities | Convergence of fashion and technology | Human-machine collaboration in fashion | Genuine, emerging trend | Creativity, production, and human-machine interaction | Robots are becoming experimental platforms | Integration of robots into runway | Aesthetic Reinvention: designing beyond the human form | Fostering Human-Robot Collaboration From Runway to Production and Retail | Challenging Beauty Norms | Paving Way for Future Trajectories: The Metaverse of Fashion | Istitutomarangoni | Fashion schools | Milano School of Fashion | Firenze School of Fashion & Art | With AI the most intriguing part of the results is the serendipity, the unknown, the unexpected outcomes that can inspire new design ideas and lead to creative breakthroughs

#Humanoids Rise | Figure’s $1 Billion | Musk $1 Trillion | Figure AI is now one of the most valuable robotics companies | Autonomous capabilities in industrial and household settings | Figure targets 100 000 annual production capacity by 2028 making ~$12 000 per humanoid | Humanoid market could hit $7 trillion by 2050, with billion humanoids | Potentially multiple humanouds per household (similar to cars) | Listed firms with humanoid projects: Hexagon (HEXA), UBTech (9880), Hyundai (Boston Dynamics)

#Robotics development platform | Autonomous mobile robots (AMRs) | Robot arms | Manipulators | Humanoids | Simulation | Robot learning frameworks | GPU accelerated libraries | AI models | Reference workflows

#OpenPI | Open-source robotics framework | Developed by Physical Intelligence | Supports large-scale training and evaluation of general-purpose robotic models like π₀ and π₀.₅, both of which are open-source and available via the OpenPI GitHub repo