Arvind Kalyan

17 Seguidores

Objective: Engineer systems & algorithms to help users get to the content they need. Summary: Hands-on experience with distributed systems for both online and offline data processing. Designed and implemented low-latency high-throughput online retrieval systems from scratch, doing micro and millisecond latencies for few hundred QPS per node (without caching). Designed and implemented simple & extensible data-infrastructure for offline data processing pipelines on hadoop. These range from simple search-index building pipelines, to non-trivial pipelines to do machine learning algorithms. Using tools like plain java map/reduce, pig, hive, spark, scalding and so forth (ordered by familiari...

concurrency jvm stm multi-core lock-free transactional memory overview presto mapreduce hadoop hdfs spark big data

Ver mais

Atividades
Sobre

Ver mais

Arvind Kalyan

Apresentações

Big Data - An Overview

jvm/java - towards lock-free concurrency

Gostaram

Invokedynamic in 45 Minutes

Distributed Consensus A.K.A. "What do we eat for lunch?"

DocValues aka. Column Stride Fields in Lucene 4.0 - By Willnauer Simon