27
9
Pakistan
4 years of experience
Hi, I'm Faran Taimoor Butt, A passionate Full Stack Software Engineer, Robotics Engineer, Computer Vision Enthusiast & Certified Computer Hacking Forensic Investigator superhero_man from Pakistan. Completed my bachelor degree from Air University I’m currently working as Research Intern at NCAI Islamabad Pakistan
With the technological leap in generative AI, there are countless new businesses and startups which are using AI image generation in their pipeline. A non significant amount of these ventures can get supercharged with the ability to convert their 2D images to immersive environments and 3D objects A simple and intuitive tool to produce immersive 360° views using Dall-E 2 API as Proof of Concept Further iterations include integrating this with unity engine so the immersive environment can be manipulated and populated with 3D objects like in a video game. This use case has been verified by our first customers and by the industry leader StabilityAI.
4 Mar 2023
Voice Out is a revolutionary AI translation assistant that aims to solve the problem of inaccurate and ineffective communication between people from different linguistic backgrounds. In today's globalized world, communication is essential, and language barriers often cause misunderstandings and hinder productivity. Traditional translation software requires the user to speak with high accuracy and clarity to ensure a correct translation, which can be difficult for non-native speakers or in noisy environments. Voice Out uses cutting-edge deep learning algorithms to analyze the nuances of a speaker's voice, identify commonly used phrases, and provide accurate translations in real-time. This enables users to express themselves naturally, even if their grammar or pronunciation is not perfect. Additionally, Voice Out's intelligent learning capabilities allow it to adapt to a user's unique voice and vocabulary, making communication more seamless over time. Voice Out's user-friendly interface displays translations in real-time, allowing users to adjust their speech or ask follow-up questions. It also has the ability to translate both spoken and written language, making it a versatile tool for a wide range of communication needs. With Voice Out, individuals and businesses alike can communicate more effectively and efficiently across language barriers, unlocking new opportunities for collaboration, understanding, and growth.
7 Apr 2023
Our idea is to use AI, ML and open source technologies to revolutionize the healthcare fields - specifically Mental Wellness and Eye Care using Autonomous AI Agents. These Autonomous AI Agents are using SuperAGI and deployed on AWS. We have given the choice to users on our landing page to use Eye Health or AI Therapist for their purposes. The Eye Health uses Eye Care AI Agent to track blinks of users and let them know how their eyes are and what is the blink status of their eyes using real time live webcam feed. This will help to notify users and recommend them best strategies for eye care and early stage detection and prevention from diseases like Dry Eyes Disorders. We have the AI Therapist that uses Mental Wellness AI Agent to track the moods of the user from a live webcam feed and the SuperAGI helps in replying to users based on the mood track what they should do in order to be in a good mood, this helps users to track real time their moods and get some suggestions from AI Therapist. People of all ages have the tendency to have long working hours and exposures to screens and devices. These causes issues for eyes and also mental health gets affected due to such behaviors, mental wellness and some basic guidance as initial steps for healthcare can be provided by the AI Therapist and that can help users to track their mental moods, in future we can also allow them to have chatbots and have conversations with AI Therapist and Eye Care Agent. Eye Care Agent can help in tracking blinks and how good their eye health is and can suggest if their blink count per minute is less than 12, it can suggest the users to have some time off, do exercises, take a break, engage in healthy activities and take care of themselves. Screen time and usage of users can also be tracked and it can help to suggest users what steps to be taken in terms of productivity, creative and efficient parental controls, gaining better leverage on data using insights from AI & ML
21 Aug 2023
We have created an extension named IntelliSum that uses LLama2 and Clarifai for Intelligent Summarization and extraction of text from a given URL, using Stable Diffusion it can also generate AI Prompts and Images for given prompts. We have made this extension that is compatible with browsers and helps users to generate summary of texts, extract arguments from a piece of text, generate AI prompts and generate images according to the AI prompts. We have used technologies like JavaScript, Python, Llama2, Clarifai and other web technologies for creating this extension, we further in future want to extend our scope to fake news and fakeness detection in articles and AI Assitants
28 Aug 2023
This app lets the user automatically scrape websites by letting Stablecode generate JavaScript code to parse a part of the HTML and convert it to a CSV. Here is how our app works: The user enters a URL and the app returns the page's HTML. The user can then tell the AI where the data is located, and Stablecode will then write javascript file to parse the HTML and convert the relevant part into a csv. We are taking inputs from the user, that input will be HTML link that user wants to scrape and parse, we have the option given to user to select an element and write a prompt accordingly to scrap and parse that element from the given HTML link. Stablecode will generate JavaScript code to parse part of HTML and convert it into CSV format. This will help users to scrape websites and get JavaScript code for the selected element as well as the Downloadable CSV format code for the selected element.
25 Aug 2023
We have used Clarifai for image recognition, we give user the option to upload image and the AI can generate prompt according to the image it recognizes, based on that it generates music prompt to be passed to Musicgen for generating Music. Currently, with the MusicGen environment we have it takes approx 20 seconds - 22 seconds to generate music. The audio output we kept is upto 6 seconds for the user to hear it and download it. We also have an option to detect and recognize live webcam feed, the AI will generate prompt according to the image recognized and will generate prompt for MusicGen to generate music. In addition, the user can also simply write the prompts on their own to generate music.
31 Aug 2023