PROFILE
Hi_ I am a data scientist with strong skills in Python, SQL, Excel and Power BI. I specialise in transforming unstructured data into meaningful insights that improve business decision-making and drive growth. I work across the data lifecycle, from problem definition and preparation to analysis, modeling and reporting. My goal is to deliver clear, data-driven solutions that connect technical outputs with business impact.
Education
Certificate in Data Science
Africa Leadership Experience 2025Skills
- Programming & Analysis: Python (pandas, NumPy, scikit-learn) SQL, Excel
 - Machine Learning: Regression, Classification, Clustering, Model Validation
 - Visualisation & BI Tools: Excel, Power BI, Matplotlib, Seaborn
 - Data Handling: Data cleaning, transformation and exploratory analysis
 - Soft Skills: Problem-solving, teamwork, communication, continuous learning
 
Awards
2nd Place in Kaggle Competition
ALX Competition ~ 2025Achieved 2nd place in the ALX Movie Recommendation Project 2025 Kaggle competition by building and optimising a high-performing recommendation model.
ViewLANGUAGES
- English
 - Swahili
 
EXPERIENCE
Data Entry
Mount Kenya Ewaso Water Partnership (MKEWP)
                                        Responsibility: Led farmer outreach, collected data on farming and fuel use, promoted energy-efficient stoves and used soil testing results to guide land use and crop choices for improved productivity.
                                        
                                        Achievement: Reduced firewood use by 40% through support for energy-efficient stove adoption. Increased farm productivity by 25% through targeted soil testing and crop planning.
                                    
PROJECTS
Segment customers based on purchasing behavior to optimise marketing and reduce return rates.
Personal
                                        Responsibility: Analysed over 70,000 transactions, created features for clustering and used K-means to segment customers into four groups based on recency, frequency and return behavior.
                                        
                                        Achievement: Reduced return rates by 18% by targeting high-risk customers with personalised discounts.
                                    
Develop a machine learning model to predict flight prices in India and integration it to app
Personal
                                        Responsibility: Cleaned and analysed flight booking data, engineered features, built regression models and deployed a Streamlit web app to deliver real-time ticket price predictions based on user inputs.
                                        
                                        Achievement: Improved price prediction accuracy by 15% with Random Forest (R² 0.95), revealed 40% holiday price gaps and cut user search time by 30% through a Streamlit app.
                                    
Sentiment analysis of Hilton Hotel reviews in London to measure customer satisfaction
Personal
                                        Responsibility: Performed text preprocessing on 5000+, cleaned TripAdvisor reviews, utilising tokenization and lemmatization, followed by the application of TF-IDF. Multiple models including Logistic Regression, Random Forest, XGBoost and LSTM were trained for sentiment classification.
                                        
                                        Achievement: Achieved 89% accuracy with LSTM, identifying 72% positive and 18% negative reviews, while highlighting recurring issues with staff service and room cleanliness and providing  insights for hotel management to enhance guest experience.
                                    
Analysed Olist e-commerce data to improve efficiency and customer experience
Personal
                                        Responsibility: Cleaning data, analysing customer behavior, regional performance and building RFM segments to guide retention and logistics strategies.
                                        
                                        Achievement: Drove regional marketing and logistics changes by revealing São Paulo 41.9% revenue share and key delivery, payment and product trends.
                                    
Monitoring SLA performance and improving service case resolution and satisfaction in UK.
Personal
                                        Responsibility: Processed 30,000+ service cases, segmented data, built SLA dashboards, tracked trends and integrated CSAT/CES metrics.
                                        
                                        Achievement: Identified 90% of pending cases met SLA, tracked 92% resolution, flagged spikes in failed transactions and fraud and mapped branch performance across six cities.
                                    
Certificates
Professional Skills for the Digital Workforce.
Builds essential skills for the digital job market. Focuses on communication, leadership and career readiness for high-demand roles across industries.
View CertificateData Science.
Developed practical skills in data analysis and visualization on Power BI, Excel, Python, SQL, Statistics, Machine Learning and Deep Learning and its implementation.
View CertificateReferences
Mount Kenya Ewaso Water Partnership (MKEWP)
- (+254) 703 122151
 - dennis.gikunda@mkewp.org