15
6
Pakistan
1 year of experience
Text2Room is a groundbreaking innovation that redefines the process of 3D content creation. This transformative method harnesses the power of pre-trained 2D text-to-image models to generate room-scale textured 3D meshes from simple text prompts. Through an iterative scene generation process, Text2Room renders 3D meshes from diverse camera angles and seamlessly fuses missing details using advanced algorithms, resulting in immersive and captivating environments. What sets Text2Room apart is its two-stage viewpoint selection strategy – the Generation Stage creates the main scene layout, while the Completion Stage intelligently fills gaps, ensuring a complete and coherent 3D representation. By democratizing 3D content creation, Text2Room eliminates complexities and accelerates the process, making it accessible to a wide range of industries including AR/VR content, gaming, and architectural visualization. Text2Room isn't just a product; it's a creative revolution that empowers users to turn their imagination into tangible, interactive experiences. With limitless applications and an ever-growing market demand for immersive 3D content, Text2Room stands at the forefront of innovation, shaping the future of content creation.
14 Aug 2023
This cloud-hosted platform utilizes Clarifai and Open Source Llama 2 models to deliver a revolutionary AI experience. [Conceptual Foundation] At the core of this endeavor are dual Large Language Models (LLMs). These are not just any AI models; they are purpose-built to emulate the two hemispheres of the human brain. One LLM excels in analytical and logical reasoning, mimicking the left hemisphere's capabilities. In contrast, the second LLM focuses on symbolic understanding and creative interpretation, akin to the right hemisphere of the brain. [Harmonization Mechanism] To ensure these two divergent models work in concert, we reintroduce the foundational model as a mediating model. This simpler AI serves as a bridge, deciding when to utilize logical analytics and when to engage in artistic ideation. It integrates the outputs of both LLMs into a cohesive and nuanced chain of thought, thus creating an AI that can think dichotomously. [User Interface] The Web User Interface (WebUI) serves as the touchpoint for human interaction. It allows users to manage and interact with both LLMs and the Mediating Model. Designed with accessibility in mind, the WebUI offers a transparent look into how the AI thinks, reasons, and makes decisions. [Technical Integrity] As a full-stack project, we've designed both front-end and back-end components using standard web technologies and machine learning frameworks. This ensures a robust, scalable, and adaptable system capable of evolving as AI and web technologies advance. [Objectives and Impact] The ultimate goal is more than just technical achievement; it's to craft an elegant solution that balances the analytical and creative facets of thought, much like a human brain. The project reflects both the scientific rigor and artistic creativity inherent in complex problem-solving. Your engagement with this project offers a glimpse into the future of AI—a future where machines don't just calculate and sort but truly think and create.
28 Aug 2023
"StableReverse" is an app that empowers users to explore, comprehend, and analyze Python code repositories hosted on GitHub. This innovative project simplifies the challenging task of reverse engineering code by offering a comprehensive suite of features and an intuitive interface, making it accessible to a broad spectrum of users, from experienced developers to data scientists and curious learners. StableReverse leverage GPT3 for analyzing the repo filse system and StableCode for writing the code. Use Cases: Code Debugging: Developers can use "StableReverse" to understand and debug unfamiliar code segments, identifying issues and improving software quality. Algorithm Exploration: Data scientists and researchers can explore complex algorithms and data processing techniques implemented in open-source projects. Learning Tool: Students and learners can gain insights into coding practices by studying and reverse engineering real-world code. Open-Source Contribution: Contributors to open-source projects can quickly grasp project structures and coding conventions. Code Auditing: Security experts can use "StableReverse" to identify potential vulnerabilities and security issues in codebases. Innovation Exploration: Innovators and entrepreneurs can explore existing codebases for inspiration and to understand emerging technologies.
25 Aug 2023
SonicVision: The Pinnacle of Interactive Storytelling and Sensory Immersion In the ever-evolving landscape of gaming and interactive experiences, SonicVision stands as a groundbreaking innovation. Developed to be showcased at the AudioCraft Hack-a-Thon 2023, this transformative platform promises to redefine the way users engage with digital worlds. A Harmonious Blend of Art and Sound At the core of SonicVision is a revolutionary amalgamation of generative music and dynamic art, all woven into compelling stories that users can not only experience but also shape. Imagine entering a fantastical world where every decision you make not only progresses the story but also influences the art and music that envelops you. With SonicVision, this is not just a possibility; it's the standard experience. The Sonic Wonders of AudioCraft A crucial component that drives the platform is AudioCraft—an AI-driven music generation system that goes beyond mere background scores. Developed in-house, AudioCraft uses state-of-the-art AI models to generate music across all genres and styles. Whether you're venturing into an enchanted forest or a post-apocalyptic city, AudioCraft crafts the perfect auditory atmosphere, complete with sound effects that impeccably align with every situation. OpenAI: The Dungeon Master of Your Dreams SonicVision's immersive storytelling experience is powered by OpenAI's Chat-GPT, which serves as the Dungeon Master of your interactive journey. This is not just a chatbot; it's a narrative genius. It utilizes a tailored prompt layer that does more than merely guide the story. Chat-GPT dynamically commands the visual and musical elements of the game, adding layers of depth and interactivity previously unexplored in digital storytelling.
31 Aug 2023
In the era of global mobility, the challenges of relocating to a new country are more relevant than ever. Navi Lingua emerges as a comprehensive solution to these challenges, serving as both a guide and a guardian for individuals embarking on this life-changing journey. Understanding and respecting a new culture is often the first hurdle in any relocation process. Navi Lingua offers interactive lessons that provide a nuanced understanding of local customs and traditions. It goes beyond mere tips and do's and don'ts, aiming for users to become integrated members of their new community. Legal procedures for securing residency in a new country can be daunting. Navi Lingua demystifies this process, providing straightforward, comprehensive information on government procedures required for residency. Language barriers can be one of the most isolating aspects of moving abroad. Navi Lingua leverages advanced AI language resources and real-time translation tools to not just overcome, but erase these barriers. Safety is a paramount concern, especially in unfamiliar settings. Navi Lingua offers a built-in safety net feature, providing emergency contact numbers, essential safety guidelines, and real-time news updates that could be vital to the user. The app is built on robust technologies like ViteJS, React JavaScript for the frontend, and FastAPI for the backend, powered by Meta's revolutionary SeamlessM4T Multilingual Model. For deployment, it uses Vercel, known for its efficiency and reliability. In essence, Navi Lingua isn't just an app; it's a comprehensive ecosystem tailored to the unique challenges and opportunities that come with relocating to a new country. It stands as a testament to how technology can be harnessed to enrich human experience, offering a user-friendly interface to navigate the complexities of life abroad.
8 Sep 2023
AI-SteerEDU aims to revolution the world of online Platforms by giving the opportunity to give feedbacks from the individuals. The learners can also ask question in different languages to get responses in same Language. Suppose he gives feedback in French then the response will also be in French. Llama 2 gives us the opportunity to train Custom datasets on their own models to get specified results. Our main aim to collect feedbacks from students for online learning coaching platforms and refer other students the same courses according to their interests. Online learning platforms are used by millions of students worldwide, but finding the right courses and materials can be overwhelming. Our challenge is to create a personalized recommendation system that helps learners discover relevant courses and resources effectively.
15 Sep 2023