Cisco

0 Seguidores

I've built data pipeline using Apache Spark, Hadoop & scikit-learn & I've done data munging - cleaning up of the data for processing, feature engineering as well as creating ML model with the clean & transformed data. I've solved ML both supervised & unsupervised ML. using Python, Scala & R. Additionally, I've done sentiment analysis, text analysis, & ML projects. As I was the only engineer in my team & picked up required technology like Hadoop, Apache Spark on my own & evaluated Python, R & Scala programming language. Automate the Sales Credit Allocation for the sales transaction. Nearly 5% of X million sales transactions need to be manually allocated to the right Sales Account Team f...

collaborative computing hadoop networking apache mahout association rule clustering data mining

Ver mais

Apresentações
Documentos
Infográficos

Mais recentes Mais populares

Cisco

Frustration-Reduced PySpark: Data engineering with DataFrames

sparklyr - Jeff Allen

A lightweight browser start page - 3x3 Links

The Secret Sauce of Successful Teams

Web Services Testing

Network Intrusion Detection Analysis using Random Forest Algorithm on Apache Mahout

Clustering and Association Rule

Time Series Forecasting for Google Inc. and Break-even analysis for Google glass.