Researchers have undertaken the formidable task of improving the independence of visually impaired people through the innovative Project Guide. This initiative seeks to empower people who are blind or have low vision by leveraging machine learning (ML) on Google Pixel phone devices, allowing them to walk or run independently. The project revolves around a waist-mounted phone, a designated guide on a pedestrian path, and a sophisticated combination of audio cues and obstacle detection to guide users safely through the physical world.
Project Guideline emerges as an innovative solution for computer vision accessibility technology. Unlike conventional methods that often involve external guides or guide animals, the project uses on-device ML designed for Google Pixel phones. The researchers behind Project Guideline have devised a comprehensive method that employs ARCore to track user position and orientation, a DeepLabV3+-based segmentation model to detect guidance, and a monocular depth ML model to identify obstacles. This unique approach allows users to independently navigate outdoor trails marked with a painted line, marking a significant advancement in assistive technology.
Delving into the intricacies of Project Guideline’s technology reveals a sophisticated system at work. The core platform is designed with C++, seamlessly integrating essential libraries such as MediaPipe. ARCore, a fundamental component, estimates the position and orientation of the user as they travel the designated path. Simultaneously, a segmentation model processes each frame, generating a binary mask that outlines the guide. The added points create a 2D map of the guidance trajectory, ensuring a complete representation of the user’s environment.
The control system dynamically selects target points on the line, providing a navigation signal that considers the user’s current position, speed and direction. This innovative approach eliminates noise caused by irregular camera movements during activities such as running, offering a more reliable user experience. Including obstacle detection, facilitated by a depth model trained on a diverse data set known as SANPO, adds an additional layer of security. The model is adept at discerning the depth of various obstacles, including people, vehicles, poles, and more. Depth maps are converted into 3D point clouds, similar to the line segmentation process, forming a comprehensive understanding of the user’s environment. The entire system is complemented by a low-latency audio system, which guarantees the delivery of audio signals in real time to guide the user effectively.
In conclusion, Project Guideline represents a transformative step in the accessibility of computer vision. The researchers’ meticulous approach addresses the challenges faced by visually impaired people and offers a holistic solution that combines machine learning, augmented reality technology and audio feedback. The decision to open the Project Guide further emphasizes the commitment to inclusion and innovation. This initiative not only improves user autonomy but also sets a precedent for future advances in assistive technology. As technology evolves, Project Guideline serves as a beacon illuminating the path to a more accessible and inclusive future.
Review the GitHub and Blog. All credit for this research goes to the researchers of this project. Also, don’t forget to join. our 33k+ ML SubReddit, 41k+ Facebook community, Discord channel, and Electronic newsletterwhere we share the latest news on ai research, interesting ai projects and more.
If you like our work, you’ll love our newsletter.
Madhur Garg is a consulting intern at MarktechPost. He is currently pursuing his Bachelor’s degree in Civil and Environmental Engineering from the Indian Institute of technology (IIT), Patna. He shares a great passion for machine learning and enjoys exploring the latest advancements in technologies and their practical applications. With a keen interest in artificial intelligence and its various applications, Madhur is determined to contribute to the field of data science and harness the potential impact of it in various industries.
<!– ai CONTENT END 2 –>