Personal Information
Organização/Local de trabalho
San Francisco Bay Area United States
Setor
Electronics / Computer Hardware
Sobre
I've built data pipeline using Apache Spark, Hadoop & scikit-learn & I've done data munging - cleaning up of the data for processing, feature engineering as well as creating ML model with the clean & transformed data. I've solved ML both supervised & unsupervised ML. using Python, Scala & R.
Additionally, I've done sentiment analysis, text analysis, & ML projects.
As I was the only engineer in my team & picked up required technology like Hadoop, Apache Spark on my own & evaluated Python, R & Scala programming language.
Automate the Sales Credit Allocation for the sales transaction. Nearly 5% of X million sales transactions need to be manually allocated to the right Sales Account Team f...
Marcadores
collaborative computing
hadoop
networking
apache mahout
association rule
clustering
data mining
Ver mais
Apresentações
(8)Gostaram
(8)Frustration-Reduced PySpark: Data engineering with DataFrames
Ilya Ganelin
•
Há 8 anos
sparklyr - Jeff Allen
Sri Ambati
•
Há 7 anos
A lightweight browser start page - 3x3 Links
Federico Elles
•
Há 15 anos
The Secret Sauce of Successful Teams
Sven Peters
•
Há 7 anos
Web Services Testing
Vladimir Soghoyan
•
Há 10 anos
Clustering and Association Rule
Cisco
•
Há 9 anos
Personal Information
Organização/Local de trabalho
San Francisco Bay Area United States
Setor
Electronics / Computer Hardware
Sobre
I've built data pipeline using Apache Spark, Hadoop & scikit-learn & I've done data munging - cleaning up of the data for processing, feature engineering as well as creating ML model with the clean & transformed data. I've solved ML both supervised & unsupervised ML. using Python, Scala & R.
Additionally, I've done sentiment analysis, text analysis, & ML projects.
As I was the only engineer in my team & picked up required technology like Hadoop, Apache Spark on my own & evaluated Python, R & Scala programming language.
Automate the Sales Credit Allocation for the sales transaction. Nearly 5% of X million sales transactions need to be manually allocated to the right Sales Account Team f...
Marcadores
collaborative computing
hadoop
networking
apache mahout
association rule
clustering
data mining
Ver mais