Name: Koshik Debanath
Profile: Software Engineer
Email: koshik.debanath@gmail.com
Linkdin: Koshik Debanath
Github: Koshik Debanath
Google Scholar: Koshik Debanath
Skills
- Natural Language Processing (NLP)
- Large Language Models (LLM)
- MLOps
- PyTorch
- TensorFlow
- Exploratory Data Analysis
- Computer Vision
- Data Visualization
- GPT-3
- Artificial Intelligence (AI)
- Data Science
- Time Series Analysis
- Deep Learning
- Machine Learning
- Python (Programming Language)
- Fine Tuning LLM's
- Computer Vision
- Convolutional Neural Networks
- Recurrent Neural Networks
- Long Short-Term Memory (LSTM)
- Generative Adversarial Networks (GAN)
- Cloud Computing
- Data Mining
- Prompt Engineering
About me
Hello! I am Koshik Debanath. I completed my Bachelor's Degree in Computer Science & Engineering (CSE) from Rajshahi University of Engineering & Technology (RUET). I am a Data Scientist and Machine Learning Engineer with expertise in Deep Learning, Convolutional Neural Networks,Time Series Analysis, Natural Language Processing and LLM's. With a background in mathematics, computer science, and statistics, I have honed my skills in developing complex algorithms and models to extract insights and drive value from large, structured and unstructured data sets.
My passion for machine learning and data science has led me to specialize in a number of areas, including deep learning, convolutional neural networks, and time series analysis. I have experience building and deploying models for a variety of applications, such as image and speech recognition, natural language processing, and predictive maintenance.
I am always eager to learn and keep up-to-date with the latest developments in the field, as I believe this is crucial to staying ahead in the rapidly-evolving world of data science. As a result, I enjoy collaborating with fellow experts and working on challenging projects that push the boundaries of what is possible in machine learning and data science. If you are interested in connecting with me, discussing new ideas, or collaborating on projects, please do not hesitate to reach out. I am always looking for new opportunities to apply my skills and knowledge, and to help drive innovation in the field of data science.
Hire Me! 👇
Achievements & Competitions
Engaging in challenges to learn, grow, and test my skills. Here are some highlights.
Machine Hack Master
Global Ranking: 539/8861
An online platform for Machine Learning competitions, showcasing consistent performance.
View ProfilePredicting House Prices
Rank: 24/2885
Bengaluru House Price Prediction Competition on MachineHack.
View DetailsData Science Championship
South Zone Rank: 73/554
Student Championship Competition on MachineHack.
View DetailsAnalytics Olympiad 2022
Rank: 82/1029
National Level Analytics Competition by MachineHack.
View DetailsData Science Championship 2024
Rank: 7/1059 (Top 10!)
Student Championship Competition on MachineHack.
View DetailsPROJECTS
PUBLICATIONS
Research papers published in prestigious conferences and journals
Work Experience
My professional journey and key contributions.
Software Engineer - I
Universal Machine Inc.
Contributing to the development of innovative software solutions, with a focus on browser extensions, API integrations, and AI-powered applications to enhance user experience and automate complex tasks.
Key Projects & Contributions:
YouTube Live Stream Bot
Description: A Chrome Extension designed to automate interactions within YouTube Live chat, leveraging AI for intelligent response generation and user engagement.
Key Responsibilities & Achievements:
- Developed the Chrome Extension foundation using JavaScript, Chrome APIs (e.g., `chrome.runtime`, `chrome.tabs`), and asynchronous programming patterns for non-blocking operations.
- Integrated YouTube Data API for real-time fetching of live chat messages and programmatically posting responses.
- Incorporated OpenAI API (e.g., GPT models) for generating contextually relevant and engaging AI-driven chat replies.
- Engineered AI features including conversational history management using `chrome.storage` for persistence and applied prompt engineering techniques to improve context understanding and response quality.
- Implemented secure Google OAuth 2.0 flow using `chrome.identity` API for user authentication and authorization to access Google services.
- Designed and implemented robust error handling and logging mechanisms for external API calls and extension operations.
Tech Stack: JavaScript, Chrome Extension APIs (Storage, Identity, Runtime, Tabs), YouTube Data API, OpenAI API, Google OAuth, HTML, CSS, Asynchronous JavaScript.
Skills Leveraged/Developed:
Data Scientist
Manaknightdigital
Leveraging data to solve complex business problems through statistical analysis, data visualization, machine learning, deep learning, and API development. My work encompasses a diverse range of data-centric tasks and projects.
Key Responsibilities & Technical Focus:
- Developing and deploying Machine Learning & Deep Learning models.
- Performing comprehensive Data Analysis & creating insightful Visualizations.
- Utilizing Statistical Tools for robust data interpretation.
- Implementing NLP techniques: Text Normalization, BERT, Sentiment Analysis, Word Embedding.
- Building and integrating solutions with Python, MySQL, Flask.
- Working with libraries like TensorFlow, Scikit-learn, Keras, Seaborn.
- Conducting Time Series Analysis for forecasting and trend identification.
- Fetching and analyzing data from GPT models.
Skills:
Key Projects:
Chatbot Development (Atlas Copco)
Goal: Create a chatbot to answer product-related questions based on provided data, managing token limits for GPT models.
Solution: Utilized GPT-4 with data from Excel (product info: Model Name, Price, Capacity, Links). Developed a Flask-based interface. Implemented strategies to handle GPT token size constraints effectively.
Tech Stack: OpenAI GPT-4, Flask, openpyxl, Pandas, NumPy, NLTK, Scikit-learn, TensorFlow, Keras.
Fraud Transaction Detection
Goal: Build an ML model to detect fraudulent transactions from historical data.
Solution: Performed EDA and Feature Extraction. Applied and compared multiple ML algorithms (XGBoost, SVC, Logistic Regression). Achieved 90% accuracy with XGBoost after hyperparameter optimization. Deployed the model for production.
Tech Stack: Scikit-learn, XGBoost, Flask, Pandas.
Michel AI Parabroker ChatBot (Cynario.ai)
Goal: Scrape data from various lenders and enable users to query this data effectively.
Solution: Implemented Retrieval Augmented Generation (RAG) for searching. Used Pinecone for vector data storage and Cohere Reranking to refine search results from scraped PDF and web data.
Tech Stack: PyPDF2, BeautifulSoup, Gunicorn, GPT-4, Pinecone, RAG, Cohere, Flask.
Football Analyst ChatBot (Kaizenwin)
Goal: Create a chatbot to analyze football pass data for specific teams/players and predict future match outcomes.
Solution: Scraped data from diverse football sources. Leveraged RAG, Pinecone, and Cohere to process and retrieve insights from the large dataset.
Tech Stack: BeautifulSoup, PyPDF2, RAG, Pinecone, Cohere.
Image Generation (Aieventbooth)
Goal: Allow users to generate images based on presets applied to a given input image.
Solution: Used Stable Diffusion for image generation. Trained the model on custom images. Image generation is triggered by user-defined presets.
Tech Stack: Transformers, PyTorch, Gunicorn, Stable Diffusion.
AI Energy Tender Matching
Goal: Match incoming tenders against an organization's privacy policies and internal data, explaining the rationale for matches.
Solution: Chunked organizational data and fine-tuned models (Llama-2-7B/13B, Mistral-7B). Used cosine similarity for initial matching and GPT-4 for detailed explanations of matched data.
Tech Stack: Transformers, PyTorch, Gunicorn, Llama-2, Mistral, GPT-4.