Header Background

Data Science Portfolio: Productivity Analysis

About Me

Passionate about technology, innovation, and data-driven decision-making within finance and business. Diverse experiences include logistics and project management, honing skills through leading roles in project event production, business intelligence and IT analysis.

Skilled in Python, SQL, Tableau, and machine learning, committed to driving excellence through problem-solving and adaptability. Eager to contribute expertise to impactful projects at the intersection of technology, finance, and innovation, leveraging entrepreneurial spirit for tangible results.

With a solid foundation in data analysis, I'm poised to apply insights and drive growth in tech companies, banks, financial management firms, and import/export companies.

Eliel Almeida

Click on my Picture to view my LinkedIn Profile

Profile Picture

Project Overview

This dataset includes important attributes of the garment manufacturing process and the productivity of the employees which had been collected manually and also been validated by the industry experts.

The project demonstrates advanced capabilities in machine learning and neural networks using SKLearn and TensorFlow, with OS and Dotenv for operational optimization.

The goal is to create an Algorithmic Machine Learning Model based on easily accessible business features. I built an in-depth Data Analysis with Interactive Graphs using Tableau to investigate productivity drivers.

I also formatted a Database where data can be stored, organized, related, secured and distributed using SQL and frameworks such as SQLAlchemy and MySQLConnector.

This dataset includes important attributes of the garment manufacturing process and employee productivity, collected manually and validated by industry experts. Productivity is defined as Total Output divided by Total Input (quantity per hour divided by total cost per hour).

The data was cleaned and formatted using Pandas, with Matplotlib and Seaborn for visualization. Click the GitHub logo to view the repository:

GitHub Repository

Interactive Data Analysis Dashboard

Machine Learning Productivity Prediction

This data was a obtained in the UC Irvine Machine Learning Repository, and there is a article published in International Journal of Business Intelligence and Data Mining on the mining process. Defining the metrics is essential, what productivity means here is: To tell you how efficiently and effectively your workforce is delivering results over a set period. Therefore having Total Output divided by Total Input we can find Productivity, since it is a garment manufacturing company we can assume that is quantity per hour divided by the total cost per hour.

The model is working without any overfitting issues, in full capacity outperforming the benchmark which is a MIT licensed study (see GibHub for links toIrvine Website, Mining Study and Benchmark). Please fill it up and test yourself, here is a description of the features:

Date: Date range is only in 2015, covering 3 months. (Example: 15/02/2015)
Week Quarter: A portion of the month, divided into four quarters. (Example: 2)
Department: Associated department with the instance. (Example: Finishing)
Day of Week: Set the day of the week, Friday is the only day off. (Example: Wednesday)
Targeted Productivity: Set by Authority for each team for each day. (Example: 0.70)
Team: Associated team number with the instance. (Example: 8)
Standard Minute Value: The allocated time for a task. (Example: 3.90)
Work in Progress: Number of unfinished items for products. (Example: 0.0)
Incentive: Amount of financial reward. (Example: 0.0)
Idle Time: Time production was interrupted. (Example: 0.0)
Idle Men: Workers idle due to production interruption. (Example: 0)
Style Change: Number of style changes for a product. (Example: 0)
Number of Workers: Workers on each team. (Example: 8)
Over Time: Overtime amount per team in minutes. (Example: 960.0)

Test the Machine Learning Model