Close

Akash Kumar

Data Engineer

Download Resume

About Me

I am a highly motivated and detail-oriented data engineer with experience in designing, building, and maintaining data pipelines. I have a strong foundation in programming languages such as Python and have experience working with technologies such as Flask for building APIs, MySQL and Neo4j for data storage and retrieval from databases, and AWS for cloud computing. My expertise in data engineering has been honed through hands-on projects, where I have worked on extracting, transforming, and loading data from various sources, creating reports based on client requirements and building scalable data solutions. I am dedicated to delivering high-quality work and am always looking for opportunities to grow my skills and knowledge. I am excited to explore new opportunities as a data engineer and to continue making an impact in the field.

Experience

Knowledge Foundry Business Solution

Data Engineer

Full-time

As a data engineer at my company, I am currently working on a project that involves utilizing various technologies to manage and analyze data. I am using Python as the primary programming language, along with Flask for building APIs, and handle multiple ETL processes to ensure that data is consistent and up-to-date. My role includes creating APIs based on client needs and generating reports to meet their requirements. For data storage and retrieval, I am using both MySQL and Neo4j, and I have been performing complex queries on the databases to extract the required information. Recently, I have been working on an ETL process that transfers data from MySQL to AWS Neptune, as we are transitioning from Neo4j to AWS Neptune for graph database management. I am excited to be a part of this project and continue to develop my skills as a data engineer.

Knowledge Foundry Business Solution

Data Engineer

Intership

During my internship as a data engineer, I had the opportunity to work on various aspects of the data pipeline, including extracting, transforming, and loading (ETL) data from MySQL database to neo4j database. I primarily used Python to build scripts for ETL processes and used technologies such as MySQL and Neo4j for data storage and retrieval. I also worked on creating APIs to allow for seamless data access for clients and generated reports based on their specific needs. My role involved handling the complete ETL process, from data extraction to storage, and ensuring that the data was accurate, consistent, and accessible. The experience allowed me to strengthen my technical skills and understand the importance of data management in a business environment. I am grateful for the opportunity to work with a team of experienced data professionals and contribute to the successful implementation of data projects.

Education

Indian institute of information technology dharward

Aug 2018 - May 2022

Bachelor of Science in Computer Science

Grade: CPGA 8.34

I am proud to have accomplished several achievements in my personal and academic life. I participated in an inter-college pencil sketch competition and demonstrated my artistic abilities. Additionally, I was part of a group of students who went to assist those affected by the floods in Karnataka and helped distribute necessary requirements to those in need. I also had the opportunity to participate in a workshop held at IIIT Hyderabad, where I was able to expand my knowledge and skills. Furthermore, I took part in a hackathon held at IIT Dharwad and was able to collaborate with a team to develop innovative solutions. I am proud to have placed 8th in the hackathon and I am looking forward to participating in more such events in the future. These experiences have allowed me to showcase my talents, give back to my community, and grow both personally and professionally.

St. Anne's High School Patna

2016 - 2017

Science

Grade: 81.40%

Projects

RANDOM DECISION FOREST APPROACH FOR MITIGATING SQL INJECTION ATTACKS

Preventing SQL injection attacks is possible by anticipating and neutralizing potentially harmful SQL statements before they are executed. Our method of choice is the use of Random Decision Forest, which has demonstrated outstanding results in mitigating SQLi attacks, with a 95% accuracy and 97% precision rate.

View Project

PREDICTION OF OCCURRENCE OF VENTRICULAR ARRHYTHMIA USING QRS FEATURES

The purpose of this research is to explore the possibility of predicting Ventricular Tachycardia Arrhythmia (VTA) early through the use of features extracted from QRS complexes. We have analyzed two characteristics, the QRS signed area and R-peak amplitude, and evaluated the VTA prediction accuracy using various machine learning techniques.

View Project

WEDDING IMAGE CLASSIFICATION

This project aims to classify wedding images based on different regions of the world using a new approach. It utilizes UNet and a modified VGG network for segmentation and classification, with the VGG network being modified by adding Inception blocks and Residual networks. Transfer learning was used on the Cifar 10 dataset before fine-tuning for the wedding image dataset and various classifiers were added to the model, with the artificial neural network being the best choice for most classifications. However, for some classifications where the dataset was smaller, traditional machine learning models like SVM and Random Forest were used. The testing data showed an accuracy of 85.96% for Indian vs International images and 83.64% for North India vs South India classification with Random Forest as the best classifier. This project showcases the potential to use image information for classification and can be extended to other fields of research.

Certifications

AWS Cloud Technical Essentials

Issued on: Dec 2022

Neo4j Certified Professional

Issued on: Jan 2022

Machine Learning

Issued on: June 2021

HTML, CSS, and Javascript for Web Developers

Issued on: July 2020

MTA: Introduction to Programming Using Python - Certified 2020 Issued by Microsoft

Issued on: Feb 2020

Programming for Everybody (Getting Started with Python)

Issued on: Jan 2020

The Data Scientist’s Toolbox

Issued on: Oct 2019

Skills

Get in Touch