20
4
Pakistan
1 year of experience
Data science enthusiast
Text2Room is a groundbreaking innovation that redefines the process of 3D content creation. This transformative method harnesses the power of pre-trained 2D text-to-image models to generate room-scale textured 3D meshes from simple text prompts. Through an iterative scene generation process, Text2Room renders 3D meshes from diverse camera angles and seamlessly fuses missing details using advanced algorithms, resulting in immersive and captivating environments. What sets Text2Room apart is its two-stage viewpoint selection strategy โ the Generation Stage creates the main scene layout, while the Completion Stage intelligently fills gaps, ensuring a complete and coherent 3D representation. By democratizing 3D content creation, Text2Room eliminates complexities and accelerates the process, making it accessible to a wide range of industries including AR/VR content, gaming, and architectural visualization. Text2Room isn't just a product; it's a creative revolution that empowers users to turn their imagination into tangible, interactive experiences. With limitless applications and an ever-growing market demand for immersive 3D content, Text2Room stands at the forefront of innovation, shaping the future of content creation.
We developed a tool using the Yi-34B-200k model that will take scientific abstracts and summarize them for a person deeply interested in life extension, which we will call a biohacker in this context. These people want to extend their lives to 200-500 years or even longer and take a wide variety of supplements and drugs to help them live longer. There is much scientific research in life extension and going through scientific papers to find the latest research is a daunting task for many biohackers who lack the education given to a medical doctor or researcher. By making summaries of these articles in a form that is easy to understand, it will help biohackers follow the latest scientific research more easily.
The problem we are trying to solve is that of accessibility of the internet by people to disabilities. Accessibility in the sense that people with disabilities eg, people with ADHD, find it hard to navigate through complex and long webpages and end up getting less from what is meant to benefit them the most. Our solutiion is Accessify, A chrome extension that uses generative AI (Gemini Ultra APIs) to be able to simplify the process of navigating through webpages. Gemini Takes in the webpage, summarises and gives a more comprehensible, understandable and storylike version of the website which can make it easy for them to navigate thorugh the site. Accessify also offers additional features such as images summarisation and it is also able to explain an image inthe context pf the words that is found around it. This will help those with poor vision, color blindness, etc to better understand their webpages. Also, a Text-to-Speech model also makes it easy for people who are blind or who just prefer spoken words to listen to the better version of the website. In the future, we hope to make Accessify open source and others can contribute to it. Other features that we will want to include oin the feature is saving preference data to be able to provide better services to our audience, improving the view of the extension, add other feaures like contrast enhancement, and integrate it with an IoT device - Braille writer that can help translate Words to Braille for those with visual impairment to use. We hope to see people with disabilitiies flourish despite their seemingly disadvantaged state, but with accessify, we can change the game. Thank you.