4
2
Pakistan
3 years of experience
I am Machine Learning Engineer with 3 years of experience in audio, video, text and image models. I have worked throughout the lifecycle of a Data Scientist including, data collection, data cleaning, generating insights from data, training models and model deploying. I love data science because i love the meaning from data. Skills including, TensorFlow, PyTorch, Scikit-Learn, Pandas, Numpy and more. Models that i have worked on: CNN (1D & 2D & 3D), UNET, Yolo, ResNet50 etc. Problems that i have solved : Audio Noise Removal, Object Detection, Video Classification, Regressive Analysis, Semantic Segmentation etc.
In this project, we confront the linguistic barriers faced by Yoruba speakers due to limited language resources. Image generation models primarily excel with English prompts, posing a challenge for non-English speakers. To address this, we embarked on a dual-track approach: data collection and model development. Firstly, recognizing the scarcity of Yoruba datasets, particularly in image generation prompts, we meticulously curated our own dataset. English sentences were carefully selected to serve as image generation prompts and then translated into Yoruba using a dictionary-based approach. Next, we developed a custom translator model trained specifically to translate Yoruba into English. This intermediary step ensures seamless integration with image generation models, allowing for smoother operation and accurate results. Through rigorous testing, we achieved an impressive 85% accuracy on the test set, affirming the efficacy of our approach. The core strength of our project lies in its ability to empower users to generate images in their native language without encountering language barriers. By collecting our own data and training custom models, we circumvent the limitations imposed by the scarcity of Yoruba resources. Leveraging the SDXL API for image generation further enhances the user experience, ensuring high-quality outputs. Looking ahead, we envision extending our efforts to include additional languages such as Fon and Dendi, expanding our dataset and catering to a broader audience. Furthermore, our ultimate goal is to develop a model capable of directly generating images from Yoruba, Fon, and Dendi without the need for translation into English. In summary, our project not only addresses a pressing need within the Yoruba-speaking community but also lays the groundwork for future advancements in multi-lingual image generation. Through our innovative approach, we pave the way for inclusive, barrier-free communication and creative expression.
TestFastTrack is an innovative app designed to enhance IELTS test preparation by providing immediate, intuitive feedback. Traditional learning methods often leave users struggling to understand their mistakes; TestFastTrack addresses this by utilizing real IELTS test passages and images to create a clear feedback loop. Users receive instant, detailed explanations for their errors, facilitating actionable learning rather than rote memorization. The app’s visual approach and progress tracking empower learners to engage with the material meaningfully and monitor their improvement over time. With over 3.5 million IELTS test-takers annually, TestFastTrack meets the growing demand for efficient, personalized learning tools.