SlideShare uma empresa Scribd logo
1 de 36
Baixar para ler offline
You	Can’t	Search	Without	Data
Bryan	Bende	– Staff	Software	Engineer	@Hortonworks
NYC	Solr/Lucene	Meetup	– December	7th 2017
2 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Agenda
à The	Problem
à Apache	NiFi Overview
à Integration	between	NiFi &	Solr
à Recent	&	Future	Work
à Demo	Cool	Stuff!
à Q&A
3 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
About	Me
à Staff	Software	Engineer	@	Hortonworks
à Apache	NiFi PMC	&	Committer
à Contributed	Solr processors	in	March	2015
– https://issues.apache.org/jira/browse/NIFI-461
à bbende@hortonworks.com /	Twitter	@bbende /	bryanbende.com
4 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
The	Problem
5 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Team	2
It	starts	out	so	simple…
Hey!	We	have	some	
important	data	to	
send	you!	
Cool!	Your	data	is	
really	important	to	
us!
Team	1
This	should	be	easy	right?...
6 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
But	what	about	formats	&	protocols?
Team	2
We	can	publish	
Avro	records	to	a	
Kafka	topic,	does	
that	work?
Oh,	well	we	have	
a	REST	service	
that	accepts	
JSON…
Team	1
7 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
And	what	about	security	&	authentication?
Team	2
Hmm	what	about	
security?	We	can	
authenticate	via	
Kerberos
Sorry,	we	only	
support	2-Way	
TLS	with	
certificates
Team	1
8 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
And	what	about	all	these	devices	at	the	edge?
We	also	need	to	
grab	data	from	all	
these	devices,	how	
are	we	going	to	do	
that?
Team	2
9 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Wouldn’t	it	be	nice	if	there	was	a	tool	that	could	
help	these	teams?
10 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Enter	Apache	NiFi…
11 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Apache NiFi
• Created to address the challenges of global enterprise dataflow
• Key features:
– Visual	Command	and	Control
– Data	Lineage	(Provenance)
– Data	Prioritization
– Data	Buffering/Back-Pressure
– Control	Latency	vs.	Throughput
– Secure	Control	Plane	/	Data	Plane
– Scale	Out	Clustering
– Extensibility
12 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
NiFi Core Concepts
FBP	Term NiFi Term Description
Information	
Packet
FlowFile Each object	moving	through	the	system.
Black Box FlowFile	
Processor
Performs	the	work, doing	some	combination	of	data	routing,	transformation,	
or	mediation	between	systems.
Bounded	
Buffer
Connection The	linkage between	processors, acting	as	queues	and	allowing	various	
processes	to	interact	at	differing	rates.
Scheduler Flow	
Controller
Maintains	the	knowledge	of	how	processes	are	connected, and	manages	the	
threads	and	allocations	thereof	which	all	processes	use.
Subnet Process	
Group
A	set	of	processes	and	their	connections,	which	can	receive	and	send	data	via	
ports.	A	process group	allows	creation	of	entirely	new	component	simply	by	
composition	of	its components.
13 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Visual	Command	&	Control
• Drag	& drop	processors	to	build	a	flow
• Start,	stop,	&	configure	components	in
real-time
• View	errors	& corresponding	messages
• View	statistics	& health	of the
dataflow
• Create shareable templates	of	
common	flows
14 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Provenance/Lineage
• Tracks	data	at	each	point	as	it	flows	
through	the	system
• Records,	indexes,	and	makes	events	
available	for	display
• Handles	fan-in/fan-out,	i.e.	merging	
and	splitting	data
• View	attributes	and	content	at	given	
points	in	time
15 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Prioritization
• Configure	a	prioritizer per	connection
• Determine	what	is	important	for	your	
data	– time	based,	arrival	order,	
importance	of	a	data	set
• Funnel	many	connections	down	to	a	
single	connection	to	prioritize	across	
data	sets
16 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Back-Pressure
• Configure	back-pressure	per	
connection
• Based	on	number	of	FlowFiles or	
total	size	of	FlowFiles
• Upstream	processor	no	longer	
scheduled	to	run	until	below	
threshold
17 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Latency	vs.	Throughput
• Choose	between	lower	latency,	or	higher	throughput	on	each	processor
• Higher	throughput	allows	framework	to	batch	together	all	operations	for	the	selected	
amount	of	time	for	improved	performance
• Processor	developer	determines	whether	to	support	this	by	using	@SupportsBatching
annotation
18 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Architecture	- Standalone
OS/Host
JVM
Flow	Controller
Web	Server
Processor	1 Extension	N
FlowFile
Repository
Content
Repository
Provenance
Repository
Local	Storage
à FlowFile Repository
– Write	Ahead	Log	
– State	of	every	FlowFile
– Pointers	to	content	repository	
(pass-by-reference)
à Content	Repository
– FlowFile content
– Copy-on-write
à Provenance	Repository
– Write	Ahead	Log	+	Lucene Indexes
– Store	&	search	lineage	events
19 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
OS/Host
JVM
Flow	Controller
Web	Server
Processor	1 Extension	N
FlowFile
Repository
Content
Repository
Provenance
Repository
Local	Storage
OS/Host
JVM
Flow	Controller
Web	Server
Processor	1 Extension	N
FlowFile
Repository
Content
Repository
Provenance
Repository
Local	Storage
Architecture	- Cluster
OS/Host
JVM
Flow	Controller
Web	Server
Processor	1 Extension	N
FlowFile
Repository
Content
Repository
Provenance
Repository
Local	Storage
ZooKeeper
à Same	dataflow	on	each	node,	
data	partitioned	across	cluster
à Access	the	UI	from	any	node
à ZooKeeper for	auto-election	of	
Cluster	Coordinator	&	Primary	
Node	
à Cluster	Coordinator	receives	
heartbeats	from	other	nodes,	
manages	joining/	disconnecting
à Primary	Node	for	scheduling	
processors	on	a	single	node
20 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
NiFi &	Solr
21 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
NiFi Solr Processors	
à Support	Solr Cloud	and	stand-alone	Solr instances
à Leverage	SolrJ (CloudSolrClient &	HttpSolrClient)
à GetSolr – Extract	new	documents
à PutSolrContentStream – Stream	flow	file	content	to	an	update	handler
22 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
PutSolrContentStream
à Choose	Solr Type	– Cloud	or	
Standard
à Specify	ZooKeeper hosts,	or	the	
Solr URL	with	core
à Specify	the	Solr path	for	the	
update	handler
à Dynamic	Properties	sent	as	
key/value	pairs	on	request
à Relationships	for	success,	
failure,	and	connection	failure
23 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
GetSolr
à Incrementally	extract	new	documents	
à Main	query	is	*:*,	Solr Query	is	
optional	filter	query
à Date	Field	used	as	filter	query,	from	
last	execution	or	initial	value
à Sorted	by	date	field	and	unique	key
à Cursor	mark	used	behind	the	scenes
à Specify	return	fields,	or	all	if	blank
à Output	Solr XML,	or	Records
24 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Interacting	With	a	Secure	Solr
à Basic	Auth
– Provider	username/password
à Kerberos
– Set	JAAS	system	property	in	bootstrap.conf
– Provide	name	of	JAAS	entry	for	processor	to	use
à TLS/SSL
– Provide	an	SSL	Context	Service
– One-way	TLS	with	Truststore only
– Two-way	TLS	with	Keystore +	Truststore
25 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Recent	&	Future	Work
26 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Problem	– Conversion	Between	Data	Formats
à Specialized	processors	to	operate	on	different	data	types
à Sometimes	missing	conversions
à Sometimes	missing	a	specific	function	for	a	data	type
à Sometimes	implemented	with	different	libraries	causing	inconsistencies
27 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Solution	– Record	Processing
à Introduce	the	concept	of	a	”record”
– Released	in	Apache	NiFi 1.2.0	(May	2017),	improvements	in	1.3.0	and	1.4.0
à Centralize	the	logic	for	reading/writing	records	into	controller	services
– Readers/Writers	for	CSV,	Json,	Avro,	etc.
à Provide	standard	processors	that	operate	on	records
– ConvertRecord,	QueryRecord,	PartitionRecord,	UpdateRecord,	etc.
à Provide	integration	with	schema	registries
– Local	Schema	Registry,	Hortonworks	Schema	Registry,	Confluent	Schema	Registry
à Can	still	handle	arbitrary	data,	but	process	records	when	appropriate
28 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Problem	– Variable	Handling
à Need	to	parametrize	values	in	the	flow	per	environment
– Connection	strings,	URLs,	File	System	paths,	etc.
à Can	set	variables	in	bootstrap.conf
– -Dmy.var=foo
à Can	set	a	properties	file	in	nifi.properties
– nifi.variable.registry.properties=production.properties
à Both	require	command	line	access
à Both	require	restart	to	pick	up	changes
29 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Solution	– First	Class	Variable	Registry	
à Variables	associated	with	a	process	group,	released	in	1.4.0
à Right-click	on	canvas	to	view	variables	for	current	group
à Hierarchical	order	of	precedence,	resolve	closest	reference	to	
component
à Editing	variables	automatically	restarts	any	components	
referencing	the	variables
30 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Problem	– How	do	I	deploy	my	flow?
à Most	organizations	want	the	classic	development	lifecycle	(dev	->	int ->	prod)
à Can	copy	flow.xml.gz between	environments
– Requires	copying	entire	data	flow
– Can’t	tell	what	changed,	hard	to	diff	if	you	put	in	version	control
– Requires	all	environments	use	the	same	encryption	key	for	sensitive	properties
à Can	make	templates	for	portions	of	the	flow
– Script	creation	of	template	and	deployment	to	next	environment
– Requires	stopping	flow	and	removing	components,	then	re-instantiating	template
– No	easy	way	to	see	changes,	hard	to	rollback
31 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Solution	– NiFi Registry
à DISCLAIMER	- UNDER	DEVELOPMENT	&	NOT	RELEASED	YET!
à Complimentary	application,	sub-project	of	Apache	NiFi
– https://github.com/apache/nifi-registry
– https://issues.apache.org/jira/projects/NIFIREG
à Central	location	for	storage/management	of	shared	resources	across	NiFi instances
à Initial	capability	to	store	and	retrieve	“versioned	flows”
à A	versioned	flow	is	a	snapshot	of	a	process	group	at	a	given	point	in	time
à Potentially	store	extensions,	shared	data	sets,	and	more	in	the	future
32 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
DEMO!!
33 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Example	Scenario
à User	data
– https://randomuser.me
à Initially	in	CSV	format
– name.title,name.first,name.last,email,registered
– mr,dennis,reyes,dennis.reyes@example.com,2012-04-10 01:54:19
– miss,carole,gomez,carole.gomez@example.com,2002-12-17 22:15:49
à Requirements
– Convert	CSV	to	JSON
– Add	a	full_name field	with	first	name	+	last	name
– Add	a	gender	field	based	on	title	(i.e.	if	title	==	mr then	MALE)
– Ingest	to	different	Solr collections	depending	on	environment
34 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Questions?
35 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Learn	more	and	join	us!
Apache NiFi site
http://nifi.apache.org
Subscribe to and collaborate at
dev@nifi.apache.org
users@nifi.apache.org
Submit Ideas or Issues
https://issues.apache.org/jira/browse/NIFI
Follow us on Twitter
@apachenifi
36 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Thank	you!

Mais conteúdo relacionado

Mais procurados

Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks
 
Integrating NiFi and Apex
Integrating NiFi and ApexIntegrating NiFi and Apex
Integrating NiFi and ApexBryan Bende
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataWorks Summit
 
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseDataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseAldrin Piri
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataWorks Summit
 
Data ingestion and distribution with apache NiFi
Data ingestion and distribution with apache NiFiData ingestion and distribution with apache NiFi
Data ingestion and distribution with apache NiFiLev Brailovskiy
 
NiFi Best Practices for the Enterprise
NiFi Best Practices for the EnterpriseNiFi Best Practices for the Enterprise
NiFi Best Practices for the EnterpriseGregory Keys
 
Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4Timothy Spann
 
Apache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop SummitApache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop SummitAldrin Piri
 
Integrating NiFi and Flink
Integrating NiFi and FlinkIntegrating NiFi and Flink
Integrating NiFi and FlinkBryan Bende
 
Running Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsRunning Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsTimothy Spann
 
Apache NiFi: latest developments for flow management at scale
Apache NiFi: latest developments for flow management at scaleApache NiFi: latest developments for flow management at scale
Apache NiFi: latest developments for flow management at scaleAbdelkrim Hadjidj
 
NiFi Developer Guide
NiFi Developer GuideNiFi Developer Guide
NiFi Developer GuideDeon Huang
 
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data AnalysisApache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data AnalysisDataWorks Summit/Hadoop Summit
 
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0DataWorks Summit
 
Local Apache NiFi Processor Debug
Local Apache NiFi Processor DebugLocal Apache NiFi Processor Debug
Local Apache NiFi Processor DebugDeon Huang
 

Mais procurados (20)

Apache Nifi Crash Course
Apache Nifi Crash CourseApache Nifi Crash Course
Apache Nifi Crash Course
 
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
 
Apache NiFi Crash Course Intro
Apache NiFi Crash Course IntroApache NiFi Crash Course Intro
Apache NiFi Crash Course Intro
 
Integrating NiFi and Apex
Integrating NiFi and ApexIntegrating NiFi and Apex
Integrating NiFi and Apex
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFi
 
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseDataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFi
 
Data ingestion and distribution with apache NiFi
Data ingestion and distribution with apache NiFiData ingestion and distribution with apache NiFi
Data ingestion and distribution with apache NiFi
 
NiFi Best Practices for the Enterprise
NiFi Best Practices for the EnterpriseNiFi Best Practices for the Enterprise
NiFi Best Practices for the Enterprise
 
Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4
 
Apache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop SummitApache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop Summit
 
Integrating NiFi and Flink
Integrating NiFi and FlinkIntegrating NiFi and Flink
Integrating NiFi and Flink
 
Running Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsRunning Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration Options
 
Apache NiFi: latest developments for flow management at scale
Apache NiFi: latest developments for flow management at scaleApache NiFi: latest developments for flow management at scale
Apache NiFi: latest developments for flow management at scale
 
Building a Smarter Home with Apache NiFi and Spark
Building a Smarter Home with Apache NiFi and SparkBuilding a Smarter Home with Apache NiFi and Spark
Building a Smarter Home with Apache NiFi and Spark
 
NiFi Developer Guide
NiFi Developer GuideNiFi Developer Guide
NiFi Developer Guide
 
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data AnalysisApache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
 
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0
 
Local Apache NiFi Processor Debug
Local Apache NiFi Processor DebugLocal Apache NiFi Processor Debug
Local Apache NiFi Processor Debug
 

Semelhante a You Can't Search Without Data

Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFiHortonworks
 
Future of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep DiveFuture of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep DiveAldrin Piri
 
State of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & CommunityState of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & CommunityAccumulo Summit
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiJoe Percivall
 
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...Data Con LA
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIsheeta Sanghi
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIsheeta Sanghi
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkHortonworks
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIsheeta Sanghi
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiConnecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiDataWorks Summit
 
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiData at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiAldrin Piri
 
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data AnalysisApache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data AnalysisDataWorks Summit/Hadoop Summit
 
Running Zeppelin in Enterprise
Running Zeppelin in EnterpriseRunning Zeppelin in Enterprise
Running Zeppelin in EnterpriseDataWorks Summit
 
Hive acid and_2.x new_features
Hive acid and_2.x new_featuresHive acid and_2.x new_features
Hive acid and_2.x new_featuresAlberto Romero
 
Integrating Apache NiFi and Apache Apex
Integrating Apache NiFi and Apache Apex Integrating Apache NiFi and Apache Apex
Integrating Apache NiFi and Apache Apex Apache Apex
 
Using Apache® NiFi to Empower Self-Organising Teams
Using Apache® NiFi to Empower Self-Organising TeamsUsing Apache® NiFi to Empower Self-Organising Teams
Using Apache® NiFi to Empower Self-Organising TeamsSebastian Carroll
 

Semelhante a You Can't Search Without Data (20)

Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFi
 
Future of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep DiveFuture of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep Dive
 
State of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & CommunityState of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & Community
 
Nifi workshop
Nifi workshopNifi workshop
Nifi workshop
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFi
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFi
 
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache Flink
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache Flink
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache Flink
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache Flink
 
Apache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop Ecosystem Apache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop Ecosystem
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiConnecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFi
 
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiData at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
 
Apache Nifi Crash Course
Apache Nifi Crash CourseApache Nifi Crash Course
Apache Nifi Crash Course
 
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data AnalysisApache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
 
Running Zeppelin in Enterprise
Running Zeppelin in EnterpriseRunning Zeppelin in Enterprise
Running Zeppelin in Enterprise
 
Hive acid and_2.x new_features
Hive acid and_2.x new_featuresHive acid and_2.x new_features
Hive acid and_2.x new_features
 
Integrating Apache NiFi and Apache Apex
Integrating Apache NiFi and Apache Apex Integrating Apache NiFi and Apache Apex
Integrating Apache NiFi and Apache Apex
 
Using Apache® NiFi to Empower Self-Organising Teams
Using Apache® NiFi to Empower Self-Organising TeamsUsing Apache® NiFi to Empower Self-Organising Teams
Using Apache® NiFi to Empower Self-Organising Teams
 

Último

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 

Último (20)

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 

You Can't Search Without Data