I am Software Engineer with focus on Machine Learning and Artificial Intelligence. I am Problem solver and try to find the optimal solution of real life problem
Evernote is used by 250 million people worldwide to collect ideas, document meetings, and much more. It runs as a local application or in the cloud and has preceded comparable applications by the big tech companies. Once the note collection has grown over the years, it can become cumbersome to keep an overview and refer to past notes. Large language models offer help to improve search functions and build knowledge from old notes. With the dawn of open-source large language models such as Falcon by the Technology Innovation Institute, new possibilities have been opened. Since these models can be run on less powerful customer hardware, private Evernote data can now be processed, preserving privacy.
24 Sep 2023
ResearchWriterGPT: An Advanced Multimodal Research Paper Writing Assistant" is an innovative project designed to revolutionize academic writing. It combines the language and vision capabilities of GPT-4 with Clarifai's advanced AI tools to assist in drafting research papers. The tool is adept at processing both textual and visual data, ensuring comprehensive coverage from abstract creation to conclusion formulation, all in APA format. The project stands out for its integration of multimodal capabilities, including image and chart scanning, and analysis. It provides direct access to prominent academic databases like Google Scholar and Arxiv, streamlining the literature review process by aggregating and filtering relevant information. Furthermore, the tool enhances user experience by supporting interactive dialogues, including audio interaction, and allows for PDF analysis and conversion. Its innovative GPT-4 Vision feature broadens the application scope by enabling detailed image analysis, including reading texts and interpreting charts from various sources like research photos and medical images. In addition, the integration of Retrieval-Augmented Generation (RAG) with Clarifai creates a vast knowledge base, processing over 1.7 million STEM articles from ArXiv for quick, contextually relevant responses. This system not only supports efficient document management but also advances AI interaction for academic research. The technology backbone of ResearchWriterGPT includes GPT-4 Turbo for text generation, GPT-4 Vision for image processing, DALL-E API for image generation, and Clarifai’s RAG system for enriched data handling. Future expansions envision incorporating advanced Clarifai models for face sentiment analysis and object detection, predictive bibliography features, and comprehensive writing support covering all aspects of a research paper.
22 Jan 2024
Interactive Language Adventures: Dive into real-life scenarios, embark on quests, and engage in dialogues that mimic everyday situations. From ordering coffee in a bustling café to exploring cultural landmarks, our interactive adventures make language learning exciting and practical. Cultural Exploration: Language is deeply intertwined with culture. Our platform not only teaches you words and phrases but also immerses you in the rich tapestry of traditions, customs, and histories. Discover the world through the lens of language. Personalized Learning: Every learner is unique. Our AI-powered platform adapts to your pace, learning style, and proficiency level. Whether you’re a beginner or an advanced learner, Speakjourney tailors its lessons to meet your needs, ensuring a personalized and effective learning experience. Real Connections: Learning a language is about connecting with people. Through our platform, you can interact with native speakers, fellow learners, and language enthusiasts. Forge friendships, practice conversations, and expand your global network
12 Jan 2024
Neue View is an OCR solution powered by AI that aims to overcome the challenges of digitizing written data and converting it into insightful information, data entry, and visualization. This web-based OCR system makes it easy and efficient for non-technical users to digitize and analyze information. The two main problems that Neue View solves are; first, digitizing written data is a challenging task for every business. It is time-consuming, expensive, and not scalable in today's digital era. Secondly, there is an impending problem of an aging society where, by 2050, 20% of the population will be 65 years and above. This issue will demand new ways to maintain the same productivity. Doing manual data entry alone for 1% of US businesses would cost around 18 billion dollars annually. Users can capture or upload images from various devices using a web application. They can then select a language output and receive digitized data. The new view will convert the image to editable text that the user can export in various file formats. Users can also edit the text to improve the accuracy of our AI model and make conversational inputs to gain insights from the data. A new feature also allows users to create visualizations for better understanding of the information.
23 Feb 2024
The idea is to use Gemini's power of multi-modal power to get a better idea of a product using pictures, text, and documents. Our system will use Gemini AI to analyze product descriptions, and Compare different products and identify the best value proposition. Review product reviews by previous buyers and the repeat purchasing pattern by the same buyer. Recommend superior alternatives based on user preferences and market trends. Allow users to input information about their personality traits for personalized recommendations. Provide an insightful report explaining the reasoning behind each recommendation. Feature a chatbot interface to guide users through the product evaluation process. Product description could be provided through various means like:- URL of the product page, PDF copy of Product Description, Photo of Product itself, A click on the product page through a Chrome Plug-in, or pre-integrated in an e-commerce site, or in a mobile shopping app, where a photo of the product could be taken in real-time
25 Mar 2024