SlideShare uma empresa Scribd logo
1 de 47
Baixar para ler offline
Copyright	
  ©	
  2013	
  Splunk	
  Inc.	
  

ArchitecAng	
  and	
  Sizing	
  	
  
your	
  Splunk	
  Deployment	
  
Simeon	
  Yep	
  

Sales	
  Engineering	
  Manager,	
  Splunk	
  

#splunkconf	
  
Legal	
  NoAces	
  
During	
  the	
  course	
  of	
  this	
  presentaAon,	
  we	
  may	
  make	
  forward-­‐looking	
  statements	
  regarding	
  future	
  events	
  or	
  the	
  
expected	
  performance	
  of	
  the	
  company.	
  We	
  cauAon	
  you	
  that	
  such	
  statements	
  reflect	
  our	
  current	
  
expectaAons	
  and	
  esAmates	
  based	
  on	
  factors	
  currently	
  known	
  to	
  us	
  and	
  that	
  actual	
  events	
  or	
  results	
  could	
  differ	
  
materially.	
  For	
  important	
  factors	
  that	
  may	
  cause	
  actual	
  results	
  to	
  differ	
  from	
  those	
  contained	
  in	
  our	
  forward-­‐
looking	
  statements,	
  please	
  review	
  our	
  filings	
  with	
  the	
  SEC.	
  	
  The	
  forward-­‐looking	
  statements	
  made	
  in	
  this	
  
presentaAon	
  are	
  being	
  made	
  as	
  of	
  the	
  Ame	
  and	
  date	
  of	
  its	
  live	
  presentaAon.	
  	
  If	
  reviewed	
  aTer	
  its	
  live	
  
presentaAon,	
  this	
  presentaAon	
  may	
  not	
  contain	
  current	
  or	
  accurate	
  informaAon.	
  	
  	
  We	
  do	
  not	
  assume	
  any	
  
obligaAon	
  to	
  update	
  any	
  forward-­‐looking	
  statements	
  we	
  may	
  make.	
  	
  In	
  addiAon,	
  any	
  informaAon	
  about	
  
our	
  roadmap	
  outlines	
  our	
  general	
  product	
  direcAon	
  and	
  is	
  subject	
  to	
  change	
  at	
  any	
  Ame	
  without	
  noAce.	
  	
  It	
  is	
  for	
  
informaAonal	
  purposes	
  only	
  and	
  shall	
  not,	
  be	
  incorporated	
  into	
  any	
  contract	
  or	
  other	
  commitment.	
  	
  Splunk	
  
undertakes	
  no	
  obligaAon	
  either	
  to	
  develop	
  the	
  features	
  or	
  funcAonality	
  described	
  or	
  to	
  include	
  any	
  such	
  feature	
  or	
  
funcAonality	
  in	
  a	
  future	
  release.	
  
	
  
Splunk,	
  Splunk>,	
  Splunk	
  Storm,	
  Listen	
  to	
  Your	
  Data,	
  SPL	
  and	
  The	
  Engine	
  for	
  Machine	
  Data	
  are	
  trademarks	
  and	
  registered	
  trademarks	
  of	
  
Splunk	
  Inc.	
  in	
  the	
  United	
  States	
  and	
  other	
  countries.	
  All	
  other	
  brand	
  names,	
  product	
  names,	
  or	
  trademarks	
  belong	
  to	
  their	
  respecCve	
  
owners.	
  	
  
©2013	
  Splunk	
  Inc.	
  All	
  rights	
  reserved.	
  

2	
  
IntroducAon	
  
About	
  Me	
  
! 
! 

5+	
  years	
  @	
  Splunk	
  
Experience:	
  	
  	
  

–  SupporAng,	
  administering,	
  and	
  architecAng	
  large	
  scale	
  deployments	
  
–  OEM	
  –	
  technical	
  sales	
  
–  Strategic	
  accounts	
  –	
  technical	
  sales	
  
! 
! 

Based	
  in	
  HQ	
  (San	
  Francisco	
  office)	
  
Currently:	
  	
  Business	
  Development,	
  Technical	
  Synergies	
  	
  

4	
  
Agenda	
  
! 

Sizing	
  Fundamentals	
  
ArchitecAng	
  Fundamentals	
  

! 

Deployment	
  Topologies	
  

! 

	
  

5	
  
Sizing	
  Fundamentals	
  
Sizing	
  Fundamentals	
  
! 
! 
! 
! 

Understand	
  the	
  sizing	
  factors	
  
Data	
  volume	
  	
  
Search	
  volume	
  
Sizing	
  sheet	
  

7	
  
Sizing	
  Factors	
  
! 

How	
  much	
  data	
  (raw	
  sizes)?	
  	
  

– 
– 
– 
– 
! 

Daily	
  volume	
  	
  
Peak	
  volume	
  
Retained	
  volume	
  (archive	
  size)	
  
Future	
  volume?	
  

How	
  much	
  searching?	
  

–  Use	
  cases	
  
–  How	
  many	
  people?	
  How	
  oTen?	
  
! 

Jobs	
  

–  SummarizaAon,	
  alerAng,	
  reporAng	
  
8	
  
Data	
  Volumes	
  
! 

EsAmate	
  input	
  volume	
  

–  Verify	
  raw	
  log	
  sizes	
  
–  Leverage	
  _internal	
  metrics	
  to	
  get	
  actual	
  input	
  volumes	
  
! 

Confirm	
  esAmates	
  with	
  actual	
  data	
  

–  Create	
  a	
  baseline	
  with	
  real	
  or	
  simulated	
  data	
  
–  Find	
  compression	
  rates	
  (range	
  from	
  30%-­‐120%,	
  typically	
  50%)	
  
–  Determine	
  retenAon	
  needs	
  
! 

Document	
  use	
  cases	
  

–  Use	
  case	
  determines	
  search	
  needs	
  
–  Plan	
  for	
  expansion	
  as	
  adopAon	
  grows	
  (search	
  and	
  volume)	
  
9	
  
Data	
  Sizing	
  Exercise	
  
! 

Via	
  Filesystem	
  
Use	
  the	
  Splunk	
  log	
  files:	
  	
  metrics.log	
  or	
  license_usage.log	
  

! 

OpAonally:	
  

! 

–  License	
  report	
  view	
  in	
  Splunk	
  Enterprise	
  6	
  
–  S.o.S	
  app	
  in	
  5.x	
  

10	
  
Search	
  Volumes	
  
! 

! 

! 

Gather	
  use	
  case	
  informaAon	
  

–  How	
  much	
  ad-­‐hoc	
  searching?	
  
–  How	
  much	
  background	
  searching?	
  

Ad-­‐hoc	
  searching	
  

–  Evaluate	
  the	
  data	
  being	
  searched	
  
–  Evaluate	
  the	
  Ame	
  duraAon	
  (real-­‐Ame	
  vs	
  historic)	
  
–  Real-­‐Ame	
  searches	
  are	
  typically	
  less	
  overhead	
  

Background	
  searching	
  

–  AlerAng	
  and	
  monitoring	
  
–  General	
  reports	
  	
  
–  Summary	
  indexing	
  
11	
  
Final	
  Sizing	
  Numbers	
  
! 

Data	
  capacity	
  	
  

–  Daily	
  and	
  peak	
  
! 

User	
  capacity	
  	
  

–  Concurrent	
  and	
  total	
  
! 

Search	
  capacity	
  	
  

–  Concurrent	
  and	
  total	
  
	
  
	
  
*Document	
  the	
  use	
  cases!!	
  

12	
  
Architecture	
  
Architecture	
  
! 
! 
! 
! 

Splunk	
  server	
  roles:	
  	
  distributed/clustered	
  deployments	
  
Reference	
  server	
  
Rules	
  of	
  thumb	
  
Hardware	
  factors	
  

14	
  
Splunk	
  Distributed	
  Roles	
  
Search	
  Head	
  (regular	
  and	
  job	
  server)	
  
search	
  head	
  

Indexer	
  
indexer	
  

Forwarder	
  (universal)	
  
forwarder	
  
15	
  
Splunk	
  Distributed	
  Roles	
  
Cluster	
  Master	
  (clustering/replicaAon	
  requirement)	
  

License	
  Master	
  

Deployment	
  Server	
  

16	
  
Recommended	
  ConfiguraAons	
  
Stand-­‐	
  
alone	
  

Indexer	
  
(distributed)	
  

Search	
  head	
  
(distributed)	
  

Indexer	
  
(clustered)	
  

Forwarding	
  

*	
  

*	
  

*	
  

*	
  

*	
  

Searching	
  

√	
  
√	
  

√	
  

*	
  

√	
  

√	
  

*	
  

√	
  

*	
  

*	
  

*	
  

√	
  

Indexing	
  
Deployment	
  
server	
  
License	
  
master	
  
Cluster	
  
master	
  

Search	
  
Cluster	
  
head	
  
master	
  	
  
(clustered)	
   (clustered)	
  

*	
  

*	
  

√	
  

√	
  -­‐	
  common	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  *	
  -­‐	
  uncommon	
  	
  
17	
  
What s	
  a	
   Reference 	
  Server?	
  
! 
! 
! 
! 
! 
! 

Sizing	
  based	
  on	
  commodity	
  x86	
  servers	
  
Dual	
  quad-­‐core	
  CPUs	
  at	
  3.0	
  GHz	
  (dual	
  six	
  core	
  is	
  common)	
  
8	
  GB	
  of	
  RAM	
  –	
  (16	
  GB	
  is	
  common)	
  
64-­‐bit	
  OS	
  
4x10k	
  RPM	
  local	
  SAS	
  drives	
  in	
  RAID	
  1+0	
  (800+	
  IOPs)	
  
VariaAons	
  cause	
  corresponding	
  changes	
  in	
  performance/
requirements	
  

18	
  
Rules	
  of	
  Thumb	
  
! 
! 
! 
! 
! 
! 

These	
  all	
  have	
  excepAons	
  and	
  qualificaAons	
  
1	
  reference	
  indexer	
  per	
  100	
  GB/day	
  
1	
  reference	
  search	
  head	
  per	
  8	
  to	
  12	
  users	
  
1	
  reference	
  job	
  server	
  per	
  20	
  concurrent	
  jobs	
  
1	
  deployment	
  server	
  per	
  3000	
  polls/min	
  
ReplicaAon	
  later…	
  

19	
  
How	
  Many	
  Indexers?	
  
! 
! 

Rule	
  of	
  thumb	
  says:	
  1	
  per	
  100	
  GB/day	
  
Leaves	
  room	
  for:	
  

–  Daily	
  peaks	
  
–  Light	
  searching	
  and	
  reporAng	
  for	
  about	
  3	
  concurrent	
  users	
  
! 

Need	
  more	
  indexers	
  for:	
  

–  Heavy	
  reporAng	
  
–  More	
  users	
  
–  Slower	
  disks,	
  slower	
  CPUs,	
  fewer	
  CPUs	
  

20	
  
How	
  Many	
  Search	
  Heads?	
  
! 
! 
! 
! 
! 
! 
! 

Rule	
  of	
  thumb	
  says:	
  1	
  per	
  8	
  to	
  12	
  concurrent	
  users	
  
Limit	
  is	
  concurrent	
  queries	
  
30-­‐50	
  web	
  sessions	
  
1:1	
  raAo	
  of	
  search	
  query	
  to	
  CPU	
  core	
  
Only	
  add	
  first	
  search	
  head	
  if	
  ≥3	
  indexers	
  
Don’t	
  add	
  search	
  heads;	
  add	
  indexers:	
  indexers	
  do	
  most	
  work	
  
But	
  you	
  need	
  more	
  if:	
  

–  Running	
  a	
  lot	
  of	
  scheduled	
  jobs	
  on	
  the	
  search	
  head	
  
21	
  
Search	
  Head	
  vs.	
  Job	
  Server	
  
! 

Search	
  Head	
  Pooling	
  (SHP):	
  	
  uses	
  NFS	
  to	
  manage	
  user	
  profiles/
configuraAons	
  and	
  job	
  queue	
  

! 
! 

Search	
  head	
  and	
  job	
  server	
  are	
  equivalent	
  with	
  SHP	
  
Use	
  job	
  servers	
  for	
  scheduled	
  searches	
  (summaries,	
  alerts,	
  	
  
and	
  reports)	
  

! 

Use	
  search	
  heads	
  for	
  ad-­‐hoc	
  searching	
  

22	
  
How	
  Many	
  Deployment	
  Servers?	
  
! 
! 
! 
! 
! 
! 

Rule	
  of	
  thumb	
  says:	
  1	
  per	
  3000	
  polls/minute	
  
Just	
  use	
  one	
  deployment	
  server,	
  and	
  adjust	
  the	
  polling	
  period	
  
Small	
  deployments	
  can	
  share	
  the	
  same	
  splunkd	
  
Low	
  requirement	
  for	
  disk	
  performance	
  (good	
  candidate	
  	
  
for	
  virtualizaAon)	
  
Windows	
  OS	
  –	
  1	
  per	
  500	
  polls/minute	
  
Or	
  use	
  something	
  other	
  than	
  deployment	
  server	
  
–  Puppet,	
  SCCM,	
  cfengine,	
  chef…	
  

23	
  
More	
  is	
  Bexer?	
  
! 

CPUs	
  

–  Search	
  process	
  uAlizes	
  up	
  to	
  1	
  CPU	
  core	
  (1:1)	
  
–  Indexers	
  sAll	
  need	
  to	
  do	
  the	
  heavy	
  liTing	
  (search	
  exists	
  on	
  indexer	
  AND	
  
search	
  head)	
  
–  Limited	
  benefit	
  for	
  indexing	
  (up	
  to	
  2	
  CPU	
  cores	
  for	
  indexing)	
  

! 

Memory	
  

–  Good	
  for	
  search	
  heads	
  and	
  indexers	
  (16+	
  GB)	
  

! 

Disks	
  

–  Faster	
  is	
  bexer	
  (15k	
  rpm)	
  
–  More	
  disks	
  in	
  RAID	
  1+0	
  =	
  faster	
  
24	
  
Performance	
  and	
  Sizing	
  Tips	
  
System	
  change	
  

Search	
  Speed	
  	
  

Indexing	
  Speed	
  

Faster	
  disks	
  

++	
  

++	
  

Add	
  an	
  indexer	
  

++	
  

++	
  

Add	
  a	
  search	
  head	
  

+	
  

Report	
  acceleraAon/
summaries	
  

++	
  
25	
  
Performance	
  and	
  Sizing	
  Tips	
  
System	
  change	
  

Search	
  Speed	
  	
  

OpAmize	
  searches	
  

+++	
  

OpAmize	
  field	
  extracAon	
  

Indexing	
  Speed	
  

+	
  
+	
  

OpAmize	
  input	
  parsing	
  

+	
  

Faster	
  CPU	
  
26	
  

+	
  
Capacity	
  à	
  Architecture	
  
! 

Sizing	
  recipe	
  

–  Capacity	
  
–  Rules	
  of	
  thumb	
  determines	
  number	
  of	
  servers	
  
! 

Building	
  blocks	
  for	
  architecture	
  

27	
  
Architecture	
  Factors	
  
! 
! 
! 
! 
! 
! 
! 

What	
  are	
  my	
  sizing	
  requirements?	
  
Where	
  is	
  the	
  data?	
  
Where	
  are	
  the	
  users?	
  
What	
  is	
  the	
  security	
  policy?	
  
What	
  are	
  the	
  retenAon	
  and	
  compliance	
  policies?	
  
What	
  is	
  the	
  availability	
  requirement?	
  
What	
  about	
  the	
  cloud?	
  

28	
  
Architecture	
  Factors	
  
! 

What	
  are	
  my	
  sizing	
  requirements?	
  	
  	
  

–  Data	
  capacity	
  
–  Search	
  capacity	
  
–  User	
  capacity	
  
! 

Obtained	
  from	
  the	
  sizing	
  process	
  

29	
  
Architecture	
  Factors	
  
! 

Where	
  is	
  the	
  data?	
  

– 
– 
– 
– 
– 
! 

Local	
  or	
  remote	
  to	
  the	
  indexing	
  machine	
  
If	
  remote	
  –	
  use	
  forwarders	
  when	
  possible	
  
Index	
  in	
  local	
  data	
  center	
  (zone)	
  or	
  index	
  centrally	
  
Persist	
  network	
  data	
  to	
  disk	
  as	
  a	
  best	
  pracAce	
  
Use	
  intermediate	
  forwarders	
  to	
  distribute	
  data	
  

Where	
  are	
  the	
  users?	
  

–  User	
  experience	
  affected	
  by	
  search	
  head	
  locaAon	
  
ê  Time	
  zone	
  tuning	
  (5.x	
  +)	
  
ê  Distributed	
  search	
  over	
  LAN	
  vs	
  WAN	
  

30	
  
Architecture	
  Factors	
  
! 

What	
  is	
  the	
  Security	
  Policy?	
  

–  Apply	
  user	
  security	
  policies	
  
ê  Auth	
  method	
  
ê  Roles	
  
ê  Filters	
  

–  Apply	
  physical	
  security	
  policies	
  
ê  Index	
  locaAon	
  

31	
  
Architecture	
  Factors	
  
! 

RetenAon,	
  compliance,	
  governance	
  

–  Where	
  is	
  the	
  data	
  allowed	
  to	
  be?	
  
–  Where	
  is	
  the	
  data	
  not	
  allowed	
  to	
  go?	
  
–  Where	
  must	
  the	
  data	
  go?	
  
! 

Availability	
  

–  Local	
  failover,	
  fault-­‐tolerance,	
  clustering	
  
–  Geographic	
  disaster	
  recovery/fault-­‐tolerance	
  
–  Index	
  replicaAon!	
  

32	
  
Architecture	
  Factors	
  
! 
! 

Same	
  old	
  story	
  
Cloud	
  consideraAons	
  

– 
– 
– 
– 

AuthenAcaAon	
  restricAons	
  
Data	
  transfer	
  costs	
  
Security	
  –	
  SSL	
  tunnel	
  
Zones	
  

33	
  
Topologies	
  
Architecture	
  Factors	
  à	
  Topology	
  
Topology	
  Examples	
  
Centralized	
   Decentralized	
  

35	
  

Hybrid	
  
Centralized	
  Topology	
  
Search Head Pooling
Indexers

Forwarders

Intermediate
Forwarder

Forwarders

Syslog Devices
36	
  
Decentralized	
  Topology	
  
Search Head Pooling

37	
  
Hybrid	
  Topology	
  

38	
  
Scaling	
  and	
  Expansion	
  
! 

Add	
  to	
  your	
  indexer	
  pool	
  for	
  more	
  performance	
  or	
  capacity	
  
–  Mixed	
  pla{orm	
  and	
  hardware	
  is	
  okay	
  

! 

Use	
  search	
  head	
  pooling	
  for	
  more	
  UI	
  capacity	
  
–  Requires	
  NFS	
  

! 

Create	
  new	
  indexes	
  for	
  new	
  data	
  types	
  
–  Follows	
  best	
  pracAces	
  

39	
  
Index	
  ReplicaAon	
  (aka	
  Clustering)	
  
! 

What	
  is	
  it?	
  

–  Indexes	
  are	
  replicated	
  to	
  1	
  or	
  more	
  indexers	
  (tunable)	
  
–  Splunk	
  controlled	
  

! 

Basics	
  

–  Master	
  node	
  (manages	
  indexing	
  and	
  searching	
  locaAon)	
  
–  Distributed	
  deployment	
  
–  NOT	
  =	
  “index	
  and	
  forward	
  “	
  

! 

HA	
  license	
  VS.	
  index	
  replicaAon	
  

–  HAL	
  –	
  Separate	
  fully	
  funcAoning	
  Splunk	
  deployments	
  
–  IR	
  –	
  Data	
  is	
  made	
  available	
  on	
  1	
  or	
  more	
  indexers	
  
40	
  
Index	
  ReplicaAon	
  	
  
! 

Rule	
  of	
  thumb:	
  	
  1	
  per	
  50	
  GB/day	
  

–  Assume	
  simple	
  replicaAon	
  (2	
  in	
  existence)	
  
–  Increase	
  in	
  I/O,	
  CPU,	
  and	
  disk	
  requirement	
  

! 

Need	
  more	
  indexers	
  if:	
  

–  Increase	
  in	
  replicaAon	
  factor	
  
–  Performance	
  or	
  capacity	
  needs	
  (search	
  and	
  index)	
  

41	
  
Index	
  ReplicaAon	
  (aka	
  –	
  Clustering)
Cluster Master

Search Head Pooling

Peer Nodes

Forwarders

42	
  
Index	
  ReplicaAon	
  
! 
! 
! 
! 

Data	
  is	
  replicated	
  and	
  made	
  available	
  
WAN	
  configuraAon	
  is	
  not	
  recommended	
  
Careful	
  consideraAon	
  when	
  inserted	
  into	
  standard	
  topologies	
  
Increases	
  	
  
–  Storage	
  requirement	
  
–  I/O	
  requirement	
  (disk,	
  network)	
  
–  Total	
  indexer	
  requirement	
  

! 

Disaster	
  recovery	
  and	
  high	
  availability	
  .conf	
  session	
  
43	
  
Final	
  Thoughts	
  
! 

Sizing	
  is	
  more	
  than	
  data	
  volume	
  –	
  it’s	
  also	
  search	
  load	
  
Centralized	
  architecture	
  is	
  the	
  baseline	
  

! 

VariaAons	
  on	
  architecture	
  are	
  driven	
  by	
  

! 

– 
– 
– 
– 
– 

Sizing	
  
Data	
  locaAon	
  
User	
  locaAon	
  
RetenAon/access/governance	
  
Availability	
  requirements	
  

44	
  
Next	
  Steps	
  
1	
   Download	
  the	
  .conf2013	
  Mobile	
  App	
  

If	
  not	
  iPhone,	
  iPad	
  or	
  Android,	
  use	
  the	
  Web	
  App	
  
	
  

2	
   Take	
  the	
  survey	
  &	
  WIN	
  A	
  PASS	
  FOR	
  .CONF2014…	
  Or	
  one	
  of	
  these	
  

bags!	
  
3	
   	
  
View	
  the	
  sessions	
  listed	
  on	
  the	
  next	
  slide	
  
Available	
  on	
  the	
  Mobile	
  App	
  
	
  

45	
  
More	
  InformaAon	
  
! 
! 
! 
! 

Contact:	
  	
  syep@splunk.com	
  	
  
DocumenaAon:	
  	
  hxp://docs.splunk.com	
  
Answers:	
  	
  hxp://answers.splunk.com	
  
Other	
  presentaAons	
  

–  Best	
  PracAces:	
  Deploying	
  Splunk	
  on	
  Physical,	
  Virtual,	
  and	
  Cloud	
  Environments	
  
–  ArchitecAng	
  Splunk	
  for	
  High	
  Availability	
  and	
  Disaster	
  Recovery	
  

46	
  
THANK	
  YOU	
  

Mais conteúdo relacionado

Mais procurados

GPU databases - How to use them and what the future holds
GPU databases - How to use them and what the future holdsGPU databases - How to use them and what the future holds
GPU databases - How to use them and what the future holds
Arnon Shimoni
 

Mais procurados (19)

L’approche Big Data en finance de marché 1/2
L’approche Big Data en finance de marché 1/2L’approche Big Data en finance de marché 1/2
L’approche Big Data en finance de marché 1/2
 
Spark at Airbnb
Spark at AirbnbSpark at Airbnb
Spark at Airbnb
 
WSO2Con ASIA 2016: Patterns for Deploying Analytics in the Real World
WSO2Con ASIA 2016: Patterns for Deploying Analytics in the Real WorldWSO2Con ASIA 2016: Patterns for Deploying Analytics in the Real World
WSO2Con ASIA 2016: Patterns for Deploying Analytics in the Real World
 
Real-time Analytics with Presto and Apache Pinot
Real-time Analytics with Presto and Apache PinotReal-time Analytics with Presto and Apache Pinot
Real-time Analytics with Presto and Apache Pinot
 
xPatterns - Spark Summit 2014
xPatterns - Spark Summit   2014xPatterns - Spark Summit   2014
xPatterns - Spark Summit 2014
 
Logging, Metrics, and APM: The Operations Trifecta (P)
Logging, Metrics, and APM: The Operations Trifecta (P)Logging, Metrics, and APM: The Operations Trifecta (P)
Logging, Metrics, and APM: The Operations Trifecta (P)
 
Pinot: Near Realtime Analytics @ Uber
Pinot: Near Realtime Analytics @ UberPinot: Near Realtime Analytics @ Uber
Pinot: Near Realtime Analytics @ Uber
 
The evolution of the big data platform @ Netflix (OSCON 2015)
The evolution of the big data platform @ Netflix (OSCON 2015)The evolution of the big data platform @ Netflix (OSCON 2015)
The evolution of the big data platform @ Netflix (OSCON 2015)
 
Real-Time, Geospatial, Maps by Neil Dahlke
Real-Time, Geospatial, Maps by Neil DahlkeReal-Time, Geospatial, Maps by Neil Dahlke
Real-Time, Geospatial, Maps by Neil Dahlke
 
Grab at Scale with Scylla
Grab at Scale with ScyllaGrab at Scale with Scylla
Grab at Scale with Scylla
 
GPU databases - How to use them and what the future holds
GPU databases - How to use them and what the future holdsGPU databases - How to use them and what the future holds
GPU databases - How to use them and what the future holds
 
Bridging the Gap Between Datasets and DataFrames
Bridging the Gap Between Datasets and DataFramesBridging the Gap Between Datasets and DataFrames
Bridging the Gap Between Datasets and DataFrames
 
Getting It Right Exactly Once: Principles for Streaming Architectures
Getting It Right Exactly Once: Principles for Streaming ArchitecturesGetting It Right Exactly Once: Principles for Streaming Architectures
Getting It Right Exactly Once: Principles for Streaming Architectures
 
The Little Warehouse That Couldn't Or: How We Learned to Stop Worrying and Mo...
The Little Warehouse That Couldn't Or: How We Learned to Stop Worrying and Mo...The Little Warehouse That Couldn't Or: How We Learned to Stop Worrying and Mo...
The Little Warehouse That Couldn't Or: How We Learned to Stop Worrying and Mo...
 
Streaming Analytics @ Uber
Streaming Analytics @ UberStreaming Analytics @ Uber
Streaming Analytics @ Uber
 
Druid Overview by Rachel Pedreschi
Druid Overview by Rachel PedreschiDruid Overview by Rachel Pedreschi
Druid Overview by Rachel Pedreschi
 
eBay Pulsar: Real-time analytics platform
eBay Pulsar: Real-time analytics platformeBay Pulsar: Real-time analytics platform
eBay Pulsar: Real-time analytics platform
 
Real-Time Geospatial Intelligence at Scale
Real-Time Geospatial Intelligence at Scale Real-Time Geospatial Intelligence at Scale
Real-Time Geospatial Intelligence at Scale
 
Pulsar - Real-time Analytics at Scale
Pulsar - Real-time Analytics at ScalePulsar - Real-time Analytics at Scale
Pulsar - Real-time Analytics at Scale
 

Destaque

Splunk live introdução
Splunk live introduçãoSplunk live introdução
Splunk live introdução
Splunk
 
SplunkLive! São Paulo 2014 - Overview by markus zirn
SplunkLive! São Paulo 2014 -  Overview by markus zirnSplunkLive! São Paulo 2014 -  Overview by markus zirn
SplunkLive! São Paulo 2014 - Overview by markus zirn
Splunk
 
Splunk live! Inteligência operacional em um mundo de bigdata
Splunk live! Inteligência operacional em um mundo de bigdataSplunk live! Inteligência operacional em um mundo de bigdata
Splunk live! Inteligência operacional em um mundo de bigdata
Splunk
 
SplunkLive! Zürich 2016 - Use Case Swisscom
SplunkLive! Zürich 2016 - Use Case SwisscomSplunkLive! Zürich 2016 - Use Case Swisscom
SplunkLive! Zürich 2016 - Use Case Swisscom
Splunk
 
SplunkLive 2011 Beginners Session
SplunkLive 2011 Beginners SessionSplunkLive 2011 Beginners Session
SplunkLive 2011 Beginners Session
Splunk
 

Destaque (20)

99 Taxi - SplunkLive! São Paulo 2015
99 Taxi - SplunkLive! São Paulo 201599 Taxi - SplunkLive! São Paulo 2015
99 Taxi - SplunkLive! São Paulo 2015
 
Vtex - SplunkLive! São Paulo 2015
Vtex - SplunkLive! São Paulo 2015Vtex - SplunkLive! São Paulo 2015
Vtex - SplunkLive! São Paulo 2015
 
Splunk live introdução
Splunk live introduçãoSplunk live introdução
Splunk live introdução
 
Caso de Sucesso Vodafone e Splunk
Caso de Sucesso Vodafone e SplunkCaso de Sucesso Vodafone e Splunk
Caso de Sucesso Vodafone e Splunk
 
SplunkLive! São Paulo 2014 - Overview by markus zirn
SplunkLive! São Paulo 2014 -  Overview by markus zirnSplunkLive! São Paulo 2014 -  Overview by markus zirn
SplunkLive! São Paulo 2014 - Overview by markus zirn
 
Exxon - SplunkLive! São Paulo 2015
Exxon - SplunkLive! São Paulo 2015Exxon - SplunkLive! São Paulo 2015
Exxon - SplunkLive! São Paulo 2015
 
Splunk live! Inteligência operacional em um mundo de bigdata
Splunk live! Inteligência operacional em um mundo de bigdataSplunk live! Inteligência operacional em um mundo de bigdata
Splunk live! Inteligência operacional em um mundo de bigdata
 
Exploring Splunk
Exploring SplunkExploring Splunk
Exploring Splunk
 
Cisco UCS and Splunk Workshop
Cisco UCS and Splunk WorkshopCisco UCS and Splunk Workshop
Cisco UCS and Splunk Workshop
 
SplunkLive! Zürich 2016 - Use Case Swisscom
SplunkLive! Zürich 2016 - Use Case SwisscomSplunkLive! Zürich 2016 - Use Case Swisscom
SplunkLive! Zürich 2016 - Use Case Swisscom
 
Splunk conf2014 - Splunk Monitoring - New Native Tools for Monitoring your Sp...
Splunk conf2014 - Splunk Monitoring - New Native Tools for Monitoring your Sp...Splunk conf2014 - Splunk Monitoring - New Native Tools for Monitoring your Sp...
Splunk conf2014 - Splunk Monitoring - New Native Tools for Monitoring your Sp...
 
Data Onboarding
Data Onboarding Data Onboarding
Data Onboarding
 
Splunk Distributed Management Console
Splunk Distributed Management Console                                         Splunk Distributed Management Console
Splunk Distributed Management Console
 
Taking Splunk to the Next Level – Architecture
Taking Splunk to the Next Level – ArchitectureTaking Splunk to the Next Level – Architecture
Taking Splunk to the Next Level – Architecture
 
SplunkLive 2011 Beginners Session
SplunkLive 2011 Beginners SessionSplunkLive 2011 Beginners Session
SplunkLive 2011 Beginners Session
 
Conf2014_SplunkSearchOptimization
Conf2014_SplunkSearchOptimizationConf2014_SplunkSearchOptimization
Conf2014_SplunkSearchOptimization
 
You Can't Protect What you Can't See. AWS Security Best Practices - Session S...
You Can't Protect What you Can't See. AWS Security Best Practices - Session S...You Can't Protect What you Can't See. AWS Security Best Practices - Session S...
You Can't Protect What you Can't See. AWS Security Best Practices - Session S...
 
Visibilidade de negócios em impressão de nota fiscal
Visibilidade de negócios em impressão de nota fiscalVisibilidade de negócios em impressão de nota fiscal
Visibilidade de negócios em impressão de nota fiscal
 
Clientes Splunk Brasil
Clientes Splunk BrasilClientes Splunk Brasil
Clientes Splunk Brasil
 
Splunklive! Universo Online
Splunklive! Universo OnlineSplunklive! Universo Online
Splunklive! Universo Online
 

Semelhante a Deploying Splunk. Arquitetura e dimensionamento do Splunk

SplunkLive! Detroit April 2013 - Domino's Pizza
SplunkLive! Detroit April 2013 - Domino's PizzaSplunkLive! Detroit April 2013 - Domino's Pizza
SplunkLive! Detroit April 2013 - Domino's Pizza
Splunk
 

Semelhante a Deploying Splunk. Arquitetura e dimensionamento do Splunk (20)

Taking Splunk to the Next Level - Technical
Taking Splunk to the Next Level - TechnicalTaking Splunk to the Next Level - Technical
Taking Splunk to the Next Level - Technical
 
Taking Splunk to the Next Level – Architecture
Taking Splunk to the Next Level – ArchitectureTaking Splunk to the Next Level – Architecture
Taking Splunk to the Next Level – Architecture
 
Taking Splunk to the Next Level - Architecture
Taking Splunk to the Next Level - ArchitectureTaking Splunk to the Next Level - Architecture
Taking Splunk to the Next Level - Architecture
 
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionTaking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout Session
 
Taking Splunk to the Next Level - Architecture
Taking Splunk to the Next Level - ArchitectureTaking Splunk to the Next Level - Architecture
Taking Splunk to the Next Level - Architecture
 
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionTaking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout Session
 
Splunk in Nordstrom: IT Operations
Splunk in Nordstrom: IT OperationsSplunk in Nordstrom: IT Operations
Splunk in Nordstrom: IT Operations
 
Conf2014_SearchHeadClustering
Conf2014_SearchHeadClusteringConf2014_SearchHeadClustering
Conf2014_SearchHeadClustering
 
Splunk Ninjas: New Features, Pivot and Search Dojo
Splunk Ninjas: New Features, Pivot and Search DojoSplunk Ninjas: New Features, Pivot and Search Dojo
Splunk Ninjas: New Features, Pivot and Search Dojo
 
Splunk Ninjas: New Features, Pivot and Search Dojo
Splunk Ninjas: New Features, Pivot and Search DojoSplunk Ninjas: New Features, Pivot and Search Dojo
Splunk Ninjas: New Features, Pivot and Search Dojo
 
Splunk Ninja: New Features, Pivot and Search Dojo
 Splunk Ninja: New Features, Pivot and Search Dojo Splunk Ninja: New Features, Pivot and Search Dojo
Splunk Ninja: New Features, Pivot and Search Dojo
 
Splunk Ninjas: New Features, Pivot and Search Dojo
Splunk Ninjas: New Features, Pivot and Search DojoSplunk Ninjas: New Features, Pivot and Search Dojo
Splunk Ninjas: New Features, Pivot and Search Dojo
 
MongoDB World 2019: Finding the Right MongoDB Atlas Cluster Size: Does This I...
MongoDB World 2019: Finding the Right MongoDB Atlas Cluster Size: Does This I...MongoDB World 2019: Finding the Right MongoDB Atlas Cluster Size: Does This I...
MongoDB World 2019: Finding the Right MongoDB Atlas Cluster Size: Does This I...
 
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionTaking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout Session
 
The Power of SPL
The Power of SPLThe Power of SPL
The Power of SPL
 
Understanding and Configuring an Effective SharePoint 2013 Search
Understanding and Configuring an Effective SharePoint 2013 SearchUnderstanding and Configuring an Effective SharePoint 2013 Search
Understanding and Configuring an Effective SharePoint 2013 Search
 
DoneDeal - AWS Data Analytics Platform
DoneDeal - AWS Data Analytics PlatformDoneDeal - AWS Data Analytics Platform
DoneDeal - AWS Data Analytics Platform
 
Webinar replay: MySQL Query Tuning Trilogy: Query tuning process and tools
Webinar replay: MySQL Query Tuning Trilogy: Query tuning process and toolsWebinar replay: MySQL Query Tuning Trilogy: Query tuning process and tools
Webinar replay: MySQL Query Tuning Trilogy: Query tuning process and tools
 
SplunkLive! Detroit April 2013 - Domino's Pizza
SplunkLive! Detroit April 2013 - Domino's PizzaSplunkLive! Detroit April 2013 - Domino's Pizza
SplunkLive! Detroit April 2013 - Domino's Pizza
 
Capacity Planning
Capacity PlanningCapacity Planning
Capacity Planning
 

Mais de Splunk

Splunk live produban
Splunk live produbanSplunk live produban
Splunk live produban
Splunk
 
Guia de referência Splunk
Guia de referência SplunkGuia de referência Splunk
Guia de referência Splunk
Splunk
 
BVMF and Splunk
BVMF and SplunkBVMF and Splunk
BVMF and Splunk
Splunk
 
Guia para inteligência operacional
Guia para inteligência operacionalGuia para inteligência operacional
Guia para inteligência operacional
Splunk
 
Gerenciamento de aplicativos
Gerenciamento de aplicativosGerenciamento de aplicativos
Gerenciamento de aplicativos
Splunk
 

Mais de Splunk (9)

Vtex - Splunk live! 2014 São Paulo
Vtex - Splunk live! 2014 São Paulo Vtex - Splunk live! 2014 São Paulo
Vtex - Splunk live! 2014 São Paulo
 
Splunk live! São Paulo 2014 - Edenred-Ticket
Splunk live! São Paulo 2014 - Edenred-TicketSplunk live! São Paulo 2014 - Edenred-Ticket
Splunk live! São Paulo 2014 - Edenred-Ticket
 
Splunk live produban
Splunk live produbanSplunk live produban
Splunk live produban
 
Introdução Splunk Brasil
Introdução Splunk BrasilIntrodução Splunk Brasil
Introdução Splunk Brasil
 
Guia de referência Splunk
Guia de referência SplunkGuia de referência Splunk
Guia de referência Splunk
 
BVMF and Splunk
BVMF and SplunkBVMF and Splunk
BVMF and Splunk
 
Visão geral da splunk
Visão geral da splunkVisão geral da splunk
Visão geral da splunk
 
Guia para inteligência operacional
Guia para inteligência operacionalGuia para inteligência operacional
Guia para inteligência operacional
 
Gerenciamento de aplicativos
Gerenciamento de aplicativosGerenciamento de aplicativos
Gerenciamento de aplicativos
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

Deploying Splunk. Arquitetura e dimensionamento do Splunk

  • 1. Copyright  ©  2013  Splunk  Inc.   ArchitecAng  and  Sizing     your  Splunk  Deployment   Simeon  Yep   Sales  Engineering  Manager,  Splunk   #splunkconf  
  • 2. Legal  NoAces   During  the  course  of  this  presentaAon,  we  may  make  forward-­‐looking  statements  regarding  future  events  or  the   expected  performance  of  the  company.  We  cauAon  you  that  such  statements  reflect  our  current   expectaAons  and  esAmates  based  on  factors  currently  known  to  us  and  that  actual  events  or  results  could  differ   materially.  For  important  factors  that  may  cause  actual  results  to  differ  from  those  contained  in  our  forward-­‐ looking  statements,  please  review  our  filings  with  the  SEC.    The  forward-­‐looking  statements  made  in  this   presentaAon  are  being  made  as  of  the  Ame  and  date  of  its  live  presentaAon.    If  reviewed  aTer  its  live   presentaAon,  this  presentaAon  may  not  contain  current  or  accurate  informaAon.      We  do  not  assume  any   obligaAon  to  update  any  forward-­‐looking  statements  we  may  make.    In  addiAon,  any  informaAon  about   our  roadmap  outlines  our  general  product  direcAon  and  is  subject  to  change  at  any  Ame  without  noAce.    It  is  for   informaAonal  purposes  only  and  shall  not,  be  incorporated  into  any  contract  or  other  commitment.    Splunk   undertakes  no  obligaAon  either  to  develop  the  features  or  funcAonality  described  or  to  include  any  such  feature  or   funcAonality  in  a  future  release.     Splunk,  Splunk>,  Splunk  Storm,  Listen  to  Your  Data,  SPL  and  The  Engine  for  Machine  Data  are  trademarks  and  registered  trademarks  of   Splunk  Inc.  in  the  United  States  and  other  countries.  All  other  brand  names,  product  names,  or  trademarks  belong  to  their  respecCve   owners.     ©2013  Splunk  Inc.  All  rights  reserved.   2  
  • 4. About  Me   !  !  5+  years  @  Splunk   Experience:       –  SupporAng,  administering,  and  architecAng  large  scale  deployments   –  OEM  –  technical  sales   –  Strategic  accounts  –  technical  sales   !  !  Based  in  HQ  (San  Francisco  office)   Currently:    Business  Development,  Technical  Synergies     4  
  • 5. Agenda   !  Sizing  Fundamentals   ArchitecAng  Fundamentals   !  Deployment  Topologies   !    5  
  • 7. Sizing  Fundamentals   !  !  !  !  Understand  the  sizing  factors   Data  volume     Search  volume   Sizing  sheet   7  
  • 8. Sizing  Factors   !  How  much  data  (raw  sizes)?     –  –  –  –  !  Daily  volume     Peak  volume   Retained  volume  (archive  size)   Future  volume?   How  much  searching?   –  Use  cases   –  How  many  people?  How  oTen?   !  Jobs   –  SummarizaAon,  alerAng,  reporAng   8  
  • 9. Data  Volumes   !  EsAmate  input  volume   –  Verify  raw  log  sizes   –  Leverage  _internal  metrics  to  get  actual  input  volumes   !  Confirm  esAmates  with  actual  data   –  Create  a  baseline  with  real  or  simulated  data   –  Find  compression  rates  (range  from  30%-­‐120%,  typically  50%)   –  Determine  retenAon  needs   !  Document  use  cases   –  Use  case  determines  search  needs   –  Plan  for  expansion  as  adopAon  grows  (search  and  volume)   9  
  • 10. Data  Sizing  Exercise   !  Via  Filesystem   Use  the  Splunk  log  files:    metrics.log  or  license_usage.log   !  OpAonally:   !  –  License  report  view  in  Splunk  Enterprise  6   –  S.o.S  app  in  5.x   10  
  • 11. Search  Volumes   !  !  !  Gather  use  case  informaAon   –  How  much  ad-­‐hoc  searching?   –  How  much  background  searching?   Ad-­‐hoc  searching   –  Evaluate  the  data  being  searched   –  Evaluate  the  Ame  duraAon  (real-­‐Ame  vs  historic)   –  Real-­‐Ame  searches  are  typically  less  overhead   Background  searching   –  AlerAng  and  monitoring   –  General  reports     –  Summary  indexing   11  
  • 12. Final  Sizing  Numbers   !  Data  capacity     –  Daily  and  peak   !  User  capacity     –  Concurrent  and  total   !  Search  capacity     –  Concurrent  and  total       *Document  the  use  cases!!   12  
  • 14. Architecture   !  !  !  !  Splunk  server  roles:    distributed/clustered  deployments   Reference  server   Rules  of  thumb   Hardware  factors   14  
  • 15. Splunk  Distributed  Roles   Search  Head  (regular  and  job  server)   search  head   Indexer   indexer   Forwarder  (universal)   forwarder   15  
  • 16. Splunk  Distributed  Roles   Cluster  Master  (clustering/replicaAon  requirement)   License  Master   Deployment  Server   16  
  • 17. Recommended  ConfiguraAons   Stand-­‐   alone   Indexer   (distributed)   Search  head   (distributed)   Indexer   (clustered)   Forwarding   *   *   *   *   *   Searching   √   √   √   *   √   √   *   √   *   *   *   √   Indexing   Deployment   server   License   master   Cluster   master   Search   Cluster   head   master     (clustered)   (clustered)   *   *   √   √  -­‐  common                    *  -­‐  uncommon     17  
  • 18. What s  a   Reference  Server?   !  !  !  !  !  !  Sizing  based  on  commodity  x86  servers   Dual  quad-­‐core  CPUs  at  3.0  GHz  (dual  six  core  is  common)   8  GB  of  RAM  –  (16  GB  is  common)   64-­‐bit  OS   4x10k  RPM  local  SAS  drives  in  RAID  1+0  (800+  IOPs)   VariaAons  cause  corresponding  changes  in  performance/ requirements   18  
  • 19. Rules  of  Thumb   !  !  !  !  !  !  These  all  have  excepAons  and  qualificaAons   1  reference  indexer  per  100  GB/day   1  reference  search  head  per  8  to  12  users   1  reference  job  server  per  20  concurrent  jobs   1  deployment  server  per  3000  polls/min   ReplicaAon  later…   19  
  • 20. How  Many  Indexers?   !  !  Rule  of  thumb  says:  1  per  100  GB/day   Leaves  room  for:   –  Daily  peaks   –  Light  searching  and  reporAng  for  about  3  concurrent  users   !  Need  more  indexers  for:   –  Heavy  reporAng   –  More  users   –  Slower  disks,  slower  CPUs,  fewer  CPUs   20  
  • 21. How  Many  Search  Heads?   !  !  !  !  !  !  !  Rule  of  thumb  says:  1  per  8  to  12  concurrent  users   Limit  is  concurrent  queries   30-­‐50  web  sessions   1:1  raAo  of  search  query  to  CPU  core   Only  add  first  search  head  if  ≥3  indexers   Don’t  add  search  heads;  add  indexers:  indexers  do  most  work   But  you  need  more  if:   –  Running  a  lot  of  scheduled  jobs  on  the  search  head   21  
  • 22. Search  Head  vs.  Job  Server   !  Search  Head  Pooling  (SHP):    uses  NFS  to  manage  user  profiles/ configuraAons  and  job  queue   !  !  Search  head  and  job  server  are  equivalent  with  SHP   Use  job  servers  for  scheduled  searches  (summaries,  alerts,     and  reports)   !  Use  search  heads  for  ad-­‐hoc  searching   22  
  • 23. How  Many  Deployment  Servers?   !  !  !  !  !  !  Rule  of  thumb  says:  1  per  3000  polls/minute   Just  use  one  deployment  server,  and  adjust  the  polling  period   Small  deployments  can  share  the  same  splunkd   Low  requirement  for  disk  performance  (good  candidate     for  virtualizaAon)   Windows  OS  –  1  per  500  polls/minute   Or  use  something  other  than  deployment  server   –  Puppet,  SCCM,  cfengine,  chef…   23  
  • 24. More  is  Bexer?   !  CPUs   –  Search  process  uAlizes  up  to  1  CPU  core  (1:1)   –  Indexers  sAll  need  to  do  the  heavy  liTing  (search  exists  on  indexer  AND   search  head)   –  Limited  benefit  for  indexing  (up  to  2  CPU  cores  for  indexing)   !  Memory   –  Good  for  search  heads  and  indexers  (16+  GB)   !  Disks   –  Faster  is  bexer  (15k  rpm)   –  More  disks  in  RAID  1+0  =  faster   24  
  • 25. Performance  and  Sizing  Tips   System  change   Search  Speed     Indexing  Speed   Faster  disks   ++   ++   Add  an  indexer   ++   ++   Add  a  search  head   +   Report  acceleraAon/ summaries   ++   25  
  • 26. Performance  and  Sizing  Tips   System  change   Search  Speed     OpAmize  searches   +++   OpAmize  field  extracAon   Indexing  Speed   +   +   OpAmize  input  parsing   +   Faster  CPU   26   +  
  • 27. Capacity  à  Architecture   !  Sizing  recipe   –  Capacity   –  Rules  of  thumb  determines  number  of  servers   !  Building  blocks  for  architecture   27  
  • 28. Architecture  Factors   !  !  !  !  !  !  !  What  are  my  sizing  requirements?   Where  is  the  data?   Where  are  the  users?   What  is  the  security  policy?   What  are  the  retenAon  and  compliance  policies?   What  is  the  availability  requirement?   What  about  the  cloud?   28  
  • 29. Architecture  Factors   !  What  are  my  sizing  requirements?       –  Data  capacity   –  Search  capacity   –  User  capacity   !  Obtained  from  the  sizing  process   29  
  • 30. Architecture  Factors   !  Where  is  the  data?   –  –  –  –  –  !  Local  or  remote  to  the  indexing  machine   If  remote  –  use  forwarders  when  possible   Index  in  local  data  center  (zone)  or  index  centrally   Persist  network  data  to  disk  as  a  best  pracAce   Use  intermediate  forwarders  to  distribute  data   Where  are  the  users?   –  User  experience  affected  by  search  head  locaAon   ê  Time  zone  tuning  (5.x  +)   ê  Distributed  search  over  LAN  vs  WAN   30  
  • 31. Architecture  Factors   !  What  is  the  Security  Policy?   –  Apply  user  security  policies   ê  Auth  method   ê  Roles   ê  Filters   –  Apply  physical  security  policies   ê  Index  locaAon   31  
  • 32. Architecture  Factors   !  RetenAon,  compliance,  governance   –  Where  is  the  data  allowed  to  be?   –  Where  is  the  data  not  allowed  to  go?   –  Where  must  the  data  go?   !  Availability   –  Local  failover,  fault-­‐tolerance,  clustering   –  Geographic  disaster  recovery/fault-­‐tolerance   –  Index  replicaAon!   32  
  • 33. Architecture  Factors   !  !  Same  old  story   Cloud  consideraAons   –  –  –  –  AuthenAcaAon  restricAons   Data  transfer  costs   Security  –  SSL  tunnel   Zones   33  
  • 35. Architecture  Factors  à  Topology   Topology  Examples   Centralized   Decentralized   35   Hybrid  
  • 36. Centralized  Topology   Search Head Pooling Indexers Forwarders Intermediate Forwarder Forwarders Syslog Devices 36  
  • 39. Scaling  and  Expansion   !  Add  to  your  indexer  pool  for  more  performance  or  capacity   –  Mixed  pla{orm  and  hardware  is  okay   !  Use  search  head  pooling  for  more  UI  capacity   –  Requires  NFS   !  Create  new  indexes  for  new  data  types   –  Follows  best  pracAces   39  
  • 40. Index  ReplicaAon  (aka  Clustering)   !  What  is  it?   –  Indexes  are  replicated  to  1  or  more  indexers  (tunable)   –  Splunk  controlled   !  Basics   –  Master  node  (manages  indexing  and  searching  locaAon)   –  Distributed  deployment   –  NOT  =  “index  and  forward  “   !  HA  license  VS.  index  replicaAon   –  HAL  –  Separate  fully  funcAoning  Splunk  deployments   –  IR  –  Data  is  made  available  on  1  or  more  indexers   40  
  • 41. Index  ReplicaAon     !  Rule  of  thumb:    1  per  50  GB/day   –  Assume  simple  replicaAon  (2  in  existence)   –  Increase  in  I/O,  CPU,  and  disk  requirement   !  Need  more  indexers  if:   –  Increase  in  replicaAon  factor   –  Performance  or  capacity  needs  (search  and  index)   41  
  • 42. Index  ReplicaAon  (aka  –  Clustering) Cluster Master Search Head Pooling Peer Nodes Forwarders 42  
  • 43. Index  ReplicaAon   !  !  !  !  Data  is  replicated  and  made  available   WAN  configuraAon  is  not  recommended   Careful  consideraAon  when  inserted  into  standard  topologies   Increases     –  Storage  requirement   –  I/O  requirement  (disk,  network)   –  Total  indexer  requirement   !  Disaster  recovery  and  high  availability  .conf  session   43  
  • 44. Final  Thoughts   !  Sizing  is  more  than  data  volume  –  it’s  also  search  load   Centralized  architecture  is  the  baseline   !  VariaAons  on  architecture  are  driven  by   !  –  –  –  –  –  Sizing   Data  locaAon   User  locaAon   RetenAon/access/governance   Availability  requirements   44  
  • 45. Next  Steps   1   Download  the  .conf2013  Mobile  App   If  not  iPhone,  iPad  or  Android,  use  the  Web  App     2   Take  the  survey  &  WIN  A  PASS  FOR  .CONF2014…  Or  one  of  these   bags!   3     View  the  sessions  listed  on  the  next  slide   Available  on  the  Mobile  App     45  
  • 46. More  InformaAon   !  !  !  !  Contact:    syep@splunk.com     DocumenaAon:    hxp://docs.splunk.com   Answers:    hxp://answers.splunk.com   Other  presentaAons   –  Best  PracAces:  Deploying  Splunk  on  Physical,  Virtual,  and  Cloud  Environments   –  ArchitecAng  Splunk  for  High  Availability  and  Disaster  Recovery   46