Personal Information
Organização/Local de trabalho
Espoo Finland
Cargo
Big Data Analytics Architect
Setor
Technology / Software / Internet
Marcadores
deep web
web crawling
hidden web
web crawler
web databases
search interfaces
web forms
web size
collaborative crawling
intelligent crawling
web metrics
apache hadoop
hadoop tuning
image similarity search
hadoop
mapreduce
tutorial
web ecosystem
deep web characterization
review
adaptive web crawling
atlanta
crawler architecture
crawling strategies
hadoop smart deployment
russian web
image search
image retrieval
hadoop job execution
map waves
image indexing
big data
web
form classifier
web data
hadoop cluster
hadoop jobs
hadoop optimization
hadoop job history
hadoop summit
amsterdam
hadoop joins
hadoop monitoring
algorithms
web engineering
web frontier
web robots
web spiders
spiders
robots
web coverage
web link structure
distributed web crawling
url frontier
stratified random sampling
random sampling
finland
web intelligence
wi-iat
usa
web structure
adaptive crawling
incremental crawling
focused crawling
publicly indexable web
web database
search forms
denmark
google
russian deep web
interface crawlers
perl
non-html forms
javascript-rich
web form crawler
mysql
dequel
deque
form query language
invisible web
ip random sampling
deep web size
dissertation
turku
lectio praecursoria
thesis
phd
js-rich
web crawlers
search interface
decision tree
aalborg
crawling algorithms
hdfs block size
hdfs
grid5k
scalability
dns-load balancing
toulouse
ip address
web characterization
host-ip clustering
virtual hosting
stratified sampling
high-dimensional indexing
multimedia retrieval
multithreaded mapper
smart deployment
mapfile
best practice
Ver mais
Apresentações
(8)Documentos
(3)Gostaram
(64)Viimeinen keisari
Sophia Shestakova
•
Há 5 anos
How Will AI Change the Role of the Data Scientist?
Hugo Gävert
•
Há 7 anos
10 more lessons learned from building Machine Learning systems
Xavier Amatriain
•
Há 8 anos
Apache Hadoop at 10
Cloudera, Inc.
•
Há 8 anos
Enabling Python to be a Better Big Data Citizen
Wes McKinney
•
Há 8 anos
2016 Spark Summit East Keynote: Matei Zaharia
Databricks
•
Há 8 anos
Node Labels in YARN
DataWorks Summit
•
Há 8 anos
Nl HUG 2016 Feb Hadoop security from the trenches
Bolke de Bruin
•
Há 8 anos
Ibis: Scaling Python Analytics on Hadoop and Impala
Wes McKinney
•
Há 8 anos
Helsinki Spark Meetup Nov 20 2015
Chris Fregly
•
Há 8 anos
Kudu: New Hadoop Storage for Fast Analytics on Fast Data
Cloudera, Inc.
•
Há 8 anos
Hadoop Backup and Disaster Recovery
Cloudera, Inc.
•
Há 11 anos
Frontera-Open Source Large Scale Web Crawling Framework
sixtyone
•
Há 8 anos
Interactive Apache Spark in Your Browser
Cloudera, Inc.
•
Há 8 anos
PySpark Best Practices
Cloudera, Inc.
•
Há 8 anos
SQL-on-Hadoop Tutorial
Daniel Abadi
•
Há 8 anos
Talk given at Internet of Things Helsinki Meetup held at the premise of Zalando
Nissanka Wickremasinghe
•
Há 8 anos
Distro-independent Hadoop cluster management
DataWorks Summit
•
Há 8 anos
Apache HBase Performance Tuning
Lars Hofhansl
•
Há 8 anos
Sampling national deep Web
Denis Shestakov
•
Há 12 anos
Intelligent web crawling
Denis Shestakov
•
Há 10 anos
Examplar-based inpainting
Olivier Le Meur
•
Há 9 anos
Hortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks
•
Há 9 anos
Search Interfaces on the Web: Querying and Characterizing, PhD dissertation
Denis Shestakov
•
Há 10 anos
Terabyte-scale image similarity search: experience and best practice
Denis Shestakov
•
Há 10 anos
Current challenges in web crawling
Denis Shestakov
•
Há 10 anos
The Evolution of Hadoop at Spotify - Through Failures and Pain
Rafał Wojdyła
•
Há 9 anos
Improving Hadoop Cluster Performance via Linux Configuration
DataWorks Summit
•
Há 9 anos
Graph Structure in the Web - Revisited. WWW2014 Web Science Track
Chris Bizer
•
Há 10 anos
Personal Information
Organização/Local de trabalho
Espoo Finland
Cargo
Big Data Analytics Architect
Setor
Technology / Software / Internet
Marcadores
deep web
web crawling
hidden web
web crawler
web databases
search interfaces
web forms
web size
collaborative crawling
intelligent crawling
web metrics
apache hadoop
hadoop tuning
image similarity search
hadoop
mapreduce
tutorial
web ecosystem
deep web characterization
review
adaptive web crawling
atlanta
crawler architecture
crawling strategies
hadoop smart deployment
russian web
image search
image retrieval
hadoop job execution
map waves
image indexing
big data
web
form classifier
web data
hadoop cluster
hadoop jobs
hadoop optimization
hadoop job history
hadoop summit
amsterdam
hadoop joins
hadoop monitoring
algorithms
web engineering
web frontier
web robots
web spiders
spiders
robots
web coverage
web link structure
distributed web crawling
url frontier
stratified random sampling
random sampling
finland
web intelligence
wi-iat
usa
web structure
adaptive crawling
incremental crawling
focused crawling
publicly indexable web
web database
search forms
denmark
google
russian deep web
interface crawlers
perl
non-html forms
javascript-rich
web form crawler
mysql
dequel
deque
form query language
invisible web
ip random sampling
deep web size
dissertation
turku
lectio praecursoria
thesis
phd
js-rich
web crawlers
search interface
decision tree
aalborg
crawling algorithms
hdfs block size
hdfs
grid5k
scalability
dns-load balancing
toulouse
ip address
web characterization
host-ip clustering
virtual hosting
stratified sampling
high-dimensional indexing
multimedia retrieval
multithreaded mapper
smart deployment
mapfile
best practice
Ver mais