Personal Information
Organização/Local de trabalho
San Francisco Bay Area United States
Setor
Electronics / Computer Hardware
Sobre
I've built data pipeline using Apache Spark, Hadoop & scikit-learn & I've done data munging - cleaning up of the data for processing, feature engineering as well as creating ML model with the clean & transformed data. I've solved ML both supervised & unsupervised ML. using Python, Scala & R.
Additionally, I've done sentiment analysis, text analysis, & ML projects.
As I was the only engineer in my team & picked up required technology like Hadoop, Apache Spark on my own & evaluated Python, R & Scala programming language.
Automate the Sales Credit Allocation for the sales transaction. Nearly 5% of X million sales transactions need to be manually allocated to the right Sales Account Team f...
Marcadores
collaborative computing
hadoop
networking
apache mahout
association rule
clustering
data mining
Ver mais
- Apresentações
- Documentos
- Infográficos
Frustration-Reduced PySpark: Data engineering with DataFrames
Ilya Ganelin
•
Há 8 anos
sparklyr - Jeff Allen
Sri Ambati
•
Há 7 anos
A lightweight browser start page - 3x3 Links
Federico Elles
•
Há 15 anos
The Secret Sauce of Successful Teams
Sven Peters
•
Há 7 anos
Web Services Testing
Vladimir Soghoyan
•
Há 10 anos
Clustering and Association Rule
Cisco
•
Há 9 anos