SlideShare uma empresa Scribd logo
1 de 25
A Cryptocurrency with DNA
Sequence Alignment as Proof-of-
work
Halil I. Ozercan*, Atalay M. Ileri*, Alper Gundogdu, A.
Kerim Senol, M. Yusuf Ozkaya, Can Alkan
Bilkent University, Ankara, Turkey
Atalay Mert Ileri
Idea, protocols
Introduction to Research course project
H. Ibrahim Ozercan, M. Yusuf Ozkaya,
A. Kerim Senol, Alper Gundogdu
Developers, Bitcoin enthusiasts
Senior Design Project
Undergrad power
No grad students were harmed during the making of this project
HTS read alignment
 Aligning HTS reads is a compute intensive
task
 ~35 CPU days per 30X genome using BWA
 ~18K human genomes / year can be sequenced
using HiSeqX Ten
 630K CPU days = ~1800 CPU years per HiSeqX Ten
 Estimated 1 million genomes by the end of 2017
 35 million CPU days = ~100K CPU years for alignment
only
HTS read alignment (2)
 Additionally, reference human genome gets an
update every 3-4 years
 Fixes minor alleles
 Fixes collapsed duplications
 Fixes contig orientation (i.e. incorrect inversions)
 Adds new sequence
 For better reliability it is best to remap existing
data to new reference
 All 1000 Genomes Project data are being remapped to
GRCh38
Remapping old, or mapping new?
 Large clusters are not infinite resources
 While remapping old data, more new data are
generated, which typically have higher priority
 Computational burden keeps increasing
Proposal: volunteer grid computing
Volunteer grid computing: BOINC
 Berkeley Open Infrastructure Network Computing
 Volunteers download “problem sets” from the server,
solve them in “spare time”, upload results back
 Made popular with the SETI@home project
 Some bioinformatics applications are ported
(Rosetta@home, RNAworld, DENIS@home)
 Total computational power of 8.68 PetaFLOPs
Read mapping w/BOINC
 Data privacy, making sure
the alignments are correct,
other potential problems
 Main Problem: HTS read
mapping uses more
compute resources on
CPU, RAM, and disk. More
unlikely for volunteers to
dedicate such resources
 Solution: Motivating
volunteers
Cryptocurrencies
 Digital “money” that uses cryptography to
ensure security in transactions and to control
creation of new units.
 Bitcoin, Dogecoin, Litecoin, etc.
 Two parts
 Mining: generation of new “block”s
 Transaction: money exchange between peers
Bitcoin
 Most popular cryptocurrency
 Invented in 2008, open-source software in
2009
 Block chain is the source of transactions
 Completely decentralized
 In 2013: 2,798,377 GH/s
 As of now: 353,633,397 GH/s
Useful in Amsterdam
Bitcoin blocks
Nonce: a number such that when the block content is hashed with the nonce, the
result is numerically smaller than the difficulty target.
Proof-of-work: finding the nonce.
• Hard to calculate
• Easy to verify
Coinami: BOINC/Bitcoin hybrid
 Calculating the nonce in Bitcoin is simply
burning up compute power.
 No practical use.
 Idea: replace the nonce calculation with
something useful, while keeping the rest of
the cryptocurrency intact
 Coinami: Coin-Application Mediator Interface
 “Application” can be anything that is hard to
compute, easy to verify
Coinami: Features
 Not decentralized, but many-centralized.
 Approved sequencing centers are signing authorities
 Root authority merely keeps track of the signing authorities
 Multiplexing reads from multiple samples prevent FASTQ file
reconstruction & enables data privacy
 BWA read aligner, but can be changed
 Uses decoy reads for verification: real reads with previously-known
alignment locations.
 Used to check whether the returned BAM is real BWA output, or forged.
 Read names are also encrypted, not possible to distinguish run IDs,
sample names, decoy vs. queries
 Demultiplexing samples and verification (decoy map checking) are
done simultaneously
 O(1) verification
Coinami: Mining
Coinami Workflow
Coinami Workflow
Coinami Workflow
Coinami Workflow
Sample Job
//These two reads are coming from SAMPLE 1
@SAMPLE1.Read425/1
CCTTNATACTTCCTGGACACCAACTGTTATACNNNGGNNNNNNNNNNNNAATGTCNNNNNCCTGGCCTTTCAAAAGCATAGGGGAATAAATTNNTCAATAA
+
CCCC#EEEEEHHHHHHHHHHHHHHGHHHHH@@###69############;>;<;=#####:9;;;HDHHHEDAEDEEEEEEHHHEEHGGH48##7:<=:<H
@ SAMPLE1.Read425356/1
ACCTAGAAGGCATGAAAAGATTAAGGAAATTTTTTAAAAAGATATTCAATGAAGAAAATATTTTGTTTTGGCTAGCATGTAAAGATTTCTTTTTTTAATGC
+
HHHHGHHHHHHHHGHHHHHHHHHHHHHHHHHHHHHHHHHHGHHDHHHGHHFGHHGHHHHFHHHHHB@DDEHHEEDGE8EDFFFIF@GGGGHHHHHHFGFHB
//These are from SAMPLE 2
@SAMPLE2.Read2340294/1
GGTAACGCTCTATGATCCAGTCGATTTTCAGAGAGACGATGGCCGAGAGATCCGGCTTACGACACTGCCCAAGGGATTAGTAGAACAACAGTGCCACAGGA
+
D@5EEGGGGBFFD8GBDDCDFEEBDADDD########################################################################
@ SAMPLE2.Read4983594/1
GGTGATGACCATGTTTTTGGTTTATCGGCGGCCCCCCCCGCTGGCGGGGGTTTTTTTGCTATCCACCATTTTGGCGGCGCACCACTCTTGAGGGTGGTGCA
+
>6,6/:@;;>BEFAGGGGE7FGDCD?E=CD#######################################################################
//DECOY read
@DECOY.1.156433.100M.MD:Z:35T64/1
AGACAAGGCAAATTAAAGGTTTAGTAAGCTAAGTGTTCATGAACACATGACAAAAACGTGCCTGCTACTATTGTTGGGTGGCATTCTATAAATGAAATTAA
+
HHHHHHHHHHHEDHHGHFHCHHFHHGHHHHFCFFHHHFHFHHHHFHHHHHHFHHHGHEFGHD@HCEGG@FFFHEDGFG<EGEEFG=GEEFEGGG=G@GEFF
Sample Job - Encrypted
@BF0C691315C8761672AEBD1F2A42ED43B4D0F9197BD3209B6CC13B27711CC946B21C6DAE1A008F75508C290B1C324EDB/1
TTGCTAAATATGCTGAAATATTCGGATTGACCTCTGCGGAAGCCAGTAAGGATATACGGCAGGGATTGAAGAGTTTCGCCGGGGAGGGAG
GGGGTTTTTAT
+
GGFFGGGFGGGGFEFFE?GGGGDFGGGGGGGBGGBFGGGGBFEEFGA?GG8DD=DFGGGFFFB##################################
####
@C480AC6C6D59F77BB873186F1A5E524039D3FFE6567A40559D9434D888FAF7239FF2ECEFD07C79B2762E777D2A074BB3/1
GCCCTCACCGACTGCCATTGTCCCTAATGCACCGTAACGGGTGTGGCTGTCTGAGCCGAGGCATATTTTTGCGCCGCCTGGCATTATCTC
CAGCACATATT
+
F@DCFB@ABBDB=CD>BDC8@4@@?<EFFDFFFBDEEAEEEEE=EDDBDA###################################################
@A78878C3BE292C0FE0F3E64D2AE9FB2640FFC6D006BC15CF107EA587DD6F0E0395E7F3ECA36A7A867C0DA19D16585146/1
GAAGAGAGCTTTATGAGTCTCATGGCTAAATCTACACTGATGAGGGCAGTGACCCGGAGGCTGGTTTATTAGTATGAAAAAGTACGTCCAC
TGATAAAACT
+
FEE=FF@EE8CDDCC>@@DD299@;+>:@<19<@>E;EEE2,@:=EEE=-7,7<:ADA@9B4B46<AA#################################
@FEAB1E450AF92466520964FD2B39E052AE07D3ECCE6C92460399749F597405B2FEB75F602573E255148F745AE88145BF/1
GTTCAGGGTGAGTCGAATGATCCCTTGCCCGCATTCAGCGGAACTGTTGAATATGGGCAAATTCAGGGAACAATAGACAACTTTCAGGAAC
TCAATGTGCA
+
HHHHFHHHDFE@FFFBGGEBCGEGGFGHHFHGHGCGGHGHGHGGHCC>=FDC?CDBEEBE+>A;5@AB;?0<<0@@C@ABEEE/.@:>::.7>>>
@:6?:A
Public key encryption + base64 encoding
Future directions
 Complete decoupling of read mapping as
proof-of-work
 Docker-based plugins to change the “work”
 Miners -> employees
 Authority servers -> employers
 Root authority -> central bank
 Web-based GUI for “job descriptions”
 A job bulletin board for different employers
Conclusions
 HTS data is monotonically increasing
 Computational analysis is the bottleneck
 Additional burden due to reference updates
 But (fortunately) embarrassingly parallel problem
 Voluntary grids may help
 “Market will decide”
 Coins give motivation to miners since alignment
is compute intensive
 Decentralized transaction with centralized
mining
Resources
 Coinami web page (created as part of senior
project)
 https://coinami.github.io/
 GitHub page (code not public yet)
 https://github.com/coinami
Acknowledgements
Bilkent
Atalay Mert İleri (now at MIT)
Halil İbrahim Özercan (now senior student)
Alper Gündoğdu (now at Facebook)
Ahmet Kerim Şenol (now at Google)
M. Yusuf Özkaya (now at Georgia Tech)
Travel fellowship to Halil I. Özercan
Minin’, minin’, minin’
Though the reads are mappin’
Keep them coins signing’
Rawhide!

Mais conteúdo relacionado

Semelhante a Coinami

Other distributed systems
Other distributed systemsOther distributed systems
Other distributed systems
Sri Prasanna
 
2014 manchester-reproducibility
2014 manchester-reproducibility2014 manchester-reproducibility
2014 manchester-reproducibility
c.titus.brown
 
MLconf - Distributed Deep Learning for Classification and Regression Problems...
MLconf - Distributed Deep Learning for Classification and Regression Problems...MLconf - Distributed Deep Learning for Classification and Regression Problems...
MLconf - Distributed Deep Learning for Classification and Regression Problems...
Sri Ambati
 
Parallelism Processor Design
Parallelism Processor DesignParallelism Processor Design
Parallelism Processor Design
Sri Prasanna
 
Cryptography for Penetration Testers (PDF version)
Cryptography for Penetration Testers (PDF version)Cryptography for Penetration Testers (PDF version)
Cryptography for Penetration Testers (PDF version)
ceng
 

Semelhante a Coinami (20)

Ethcon seoul 2019 presentation final
Ethcon seoul 2019 presentation finalEthcon seoul 2019 presentation final
Ethcon seoul 2019 presentation final
 
Other distributed systems
Other distributed systemsOther distributed systems
Other distributed systems
 
CodeQL a Powerful Binary Analysis Engine
CodeQL a Powerful Binary Analysis EngineCodeQL a Powerful Binary Analysis Engine
CodeQL a Powerful Binary Analysis Engine
 
2014 nicta-reproducibility
2014 nicta-reproducibility2014 nicta-reproducibility
2014 nicta-reproducibility
 
tezos_hands-on-training.pdf
tezos_hands-on-training.pdftezos_hands-on-training.pdf
tezos_hands-on-training.pdf
 
Encode x Tezos Hack: Hands-on dApp Training
Encode x Tezos Hack: Hands-on dApp Training Encode x Tezos Hack: Hands-on dApp Training
Encode x Tezos Hack: Hands-on dApp Training
 
Data Grids with Oracle Coherence
Data Grids with Oracle CoherenceData Grids with Oracle Coherence
Data Grids with Oracle Coherence
 
Need for Async: Hot pursuit for scalable applications
Need for Async: Hot pursuit for scalable applicationsNeed for Async: Hot pursuit for scalable applications
Need for Async: Hot pursuit for scalable applications
 
Measuring Your Code
Measuring Your CodeMeasuring Your Code
Measuring Your Code
 
Measuring Your Code 2.0
Measuring Your Code 2.0Measuring Your Code 2.0
Measuring Your Code 2.0
 
2014 manchester-reproducibility
2014 manchester-reproducibility2014 manchester-reproducibility
2014 manchester-reproducibility
 
Ethereum Devcon1 Report (summary writing)
Ethereum Devcon1 Report (summary writing)Ethereum Devcon1 Report (summary writing)
Ethereum Devcon1 Report (summary writing)
 
MLconf - Distributed Deep Learning for Classification and Regression Problems...
MLconf - Distributed Deep Learning for Classification and Regression Problems...MLconf - Distributed Deep Learning for Classification and Regression Problems...
MLconf - Distributed Deep Learning for Classification and Regression Problems...
 
Zero ETL analytics with LLAP in Azure HDInsight
Zero ETL analytics with LLAP in Azure HDInsightZero ETL analytics with LLAP in Azure HDInsight
Zero ETL analytics with LLAP in Azure HDInsight
 
Parallelism Processor Design
Parallelism Processor DesignParallelism Processor Design
Parallelism Processor Design
 
Clouds: All fluff and no substance?
Clouds: All fluff and no substance?Clouds: All fluff and no substance?
Clouds: All fluff and no substance?
 
Cryptography for Penetration Testers (PDF version)
Cryptography for Penetration Testers (PDF version)Cryptography for Penetration Testers (PDF version)
Cryptography for Penetration Testers (PDF version)
 
[ETHCon Korea 2019] Lee heungno 이흥노
[ETHCon Korea 2019] Lee heungno 이흥노[ETHCon Korea 2019] Lee heungno 이흥노
[ETHCon Korea 2019] Lee heungno 이흥노
 
Chronicle accelerate building a digital currency
Chronicle accelerate   building a digital currencyChronicle accelerate   building a digital currency
Chronicle accelerate building a digital currency
 
Integration Patterns for Big Data Applications
Integration Patterns for Big Data ApplicationsIntegration Patterns for Big Data Applications
Integration Patterns for Big Data Applications
 

Último

Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 

Último (20)

Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 

Coinami

  • 1. A Cryptocurrency with DNA Sequence Alignment as Proof-of- work Halil I. Ozercan*, Atalay M. Ileri*, Alper Gundogdu, A. Kerim Senol, M. Yusuf Ozkaya, Can Alkan Bilkent University, Ankara, Turkey
  • 2. Atalay Mert Ileri Idea, protocols Introduction to Research course project H. Ibrahim Ozercan, M. Yusuf Ozkaya, A. Kerim Senol, Alper Gundogdu Developers, Bitcoin enthusiasts Senior Design Project Undergrad power No grad students were harmed during the making of this project
  • 3. HTS read alignment  Aligning HTS reads is a compute intensive task  ~35 CPU days per 30X genome using BWA  ~18K human genomes / year can be sequenced using HiSeqX Ten  630K CPU days = ~1800 CPU years per HiSeqX Ten  Estimated 1 million genomes by the end of 2017  35 million CPU days = ~100K CPU years for alignment only
  • 4. HTS read alignment (2)  Additionally, reference human genome gets an update every 3-4 years  Fixes minor alleles  Fixes collapsed duplications  Fixes contig orientation (i.e. incorrect inversions)  Adds new sequence  For better reliability it is best to remap existing data to new reference  All 1000 Genomes Project data are being remapped to GRCh38
  • 5. Remapping old, or mapping new?  Large clusters are not infinite resources  While remapping old data, more new data are generated, which typically have higher priority  Computational burden keeps increasing Proposal: volunteer grid computing
  • 6. Volunteer grid computing: BOINC  Berkeley Open Infrastructure Network Computing  Volunteers download “problem sets” from the server, solve them in “spare time”, upload results back  Made popular with the SETI@home project  Some bioinformatics applications are ported (Rosetta@home, RNAworld, DENIS@home)  Total computational power of 8.68 PetaFLOPs
  • 7. Read mapping w/BOINC  Data privacy, making sure the alignments are correct, other potential problems  Main Problem: HTS read mapping uses more compute resources on CPU, RAM, and disk. More unlikely for volunteers to dedicate such resources  Solution: Motivating volunteers
  • 8. Cryptocurrencies  Digital “money” that uses cryptography to ensure security in transactions and to control creation of new units.  Bitcoin, Dogecoin, Litecoin, etc.  Two parts  Mining: generation of new “block”s  Transaction: money exchange between peers
  • 9. Bitcoin  Most popular cryptocurrency  Invented in 2008, open-source software in 2009  Block chain is the source of transactions  Completely decentralized  In 2013: 2,798,377 GH/s  As of now: 353,633,397 GH/s
  • 11. Bitcoin blocks Nonce: a number such that when the block content is hashed with the nonce, the result is numerically smaller than the difficulty target. Proof-of-work: finding the nonce. • Hard to calculate • Easy to verify
  • 12. Coinami: BOINC/Bitcoin hybrid  Calculating the nonce in Bitcoin is simply burning up compute power.  No practical use.  Idea: replace the nonce calculation with something useful, while keeping the rest of the cryptocurrency intact  Coinami: Coin-Application Mediator Interface  “Application” can be anything that is hard to compute, easy to verify
  • 13. Coinami: Features  Not decentralized, but many-centralized.  Approved sequencing centers are signing authorities  Root authority merely keeps track of the signing authorities  Multiplexing reads from multiple samples prevent FASTQ file reconstruction & enables data privacy  BWA read aligner, but can be changed  Uses decoy reads for verification: real reads with previously-known alignment locations.  Used to check whether the returned BAM is real BWA output, or forged.  Read names are also encrypted, not possible to distinguish run IDs, sample names, decoy vs. queries  Demultiplexing samples and verification (decoy map checking) are done simultaneously  O(1) verification
  • 19. Sample Job //These two reads are coming from SAMPLE 1 @SAMPLE1.Read425/1 CCTTNATACTTCCTGGACACCAACTGTTATACNNNGGNNNNNNNNNNNNAATGTCNNNNNCCTGGCCTTTCAAAAGCATAGGGGAATAAATTNNTCAATAA + CCCC#EEEEEHHHHHHHHHHHHHHGHHHHH@@###69############;>;<;=#####:9;;;HDHHHEDAEDEEEEEEHHHEEHGGH48##7:<=:<H @ SAMPLE1.Read425356/1 ACCTAGAAGGCATGAAAAGATTAAGGAAATTTTTTAAAAAGATATTCAATGAAGAAAATATTTTGTTTTGGCTAGCATGTAAAGATTTCTTTTTTTAATGC + HHHHGHHHHHHHHGHHHHHHHHHHHHHHHHHHHHHHHHHHGHHDHHHGHHFGHHGHHHHFHHHHHB@DDEHHEEDGE8EDFFFIF@GGGGHHHHHHFGFHB //These are from SAMPLE 2 @SAMPLE2.Read2340294/1 GGTAACGCTCTATGATCCAGTCGATTTTCAGAGAGACGATGGCCGAGAGATCCGGCTTACGACACTGCCCAAGGGATTAGTAGAACAACAGTGCCACAGGA + D@5EEGGGGBFFD8GBDDCDFEEBDADDD######################################################################## @ SAMPLE2.Read4983594/1 GGTGATGACCATGTTTTTGGTTTATCGGCGGCCCCCCCCGCTGGCGGGGGTTTTTTTGCTATCCACCATTTTGGCGGCGCACCACTCTTGAGGGTGGTGCA + >6,6/:@;;>BEFAGGGGE7FGDCD?E=CD####################################################################### //DECOY read @DECOY.1.156433.100M.MD:Z:35T64/1 AGACAAGGCAAATTAAAGGTTTAGTAAGCTAAGTGTTCATGAACACATGACAAAAACGTGCCTGCTACTATTGTTGGGTGGCATTCTATAAATGAAATTAA + HHHHHHHHHHHEDHHGHFHCHHFHHGHHHHFCFFHHHFHFHHHHFHHHHHHFHHHGHEFGHD@HCEGG@FFFHEDGFG<EGEEFG=GEEFEGGG=G@GEFF
  • 20. Sample Job - Encrypted @BF0C691315C8761672AEBD1F2A42ED43B4D0F9197BD3209B6CC13B27711CC946B21C6DAE1A008F75508C290B1C324EDB/1 TTGCTAAATATGCTGAAATATTCGGATTGACCTCTGCGGAAGCCAGTAAGGATATACGGCAGGGATTGAAGAGTTTCGCCGGGGAGGGAG GGGGTTTTTAT + GGFFGGGFGGGGFEFFE?GGGGDFGGGGGGGBGGBFGGGGBFEEFGA?GG8DD=DFGGGFFFB################################## #### @C480AC6C6D59F77BB873186F1A5E524039D3FFE6567A40559D9434D888FAF7239FF2ECEFD07C79B2762E777D2A074BB3/1 GCCCTCACCGACTGCCATTGTCCCTAATGCACCGTAACGGGTGTGGCTGTCTGAGCCGAGGCATATTTTTGCGCCGCCTGGCATTATCTC CAGCACATATT + F@DCFB@ABBDB=CD>BDC8@4@@?<EFFDFFFBDEEAEEEEE=EDDBDA################################################### @A78878C3BE292C0FE0F3E64D2AE9FB2640FFC6D006BC15CF107EA587DD6F0E0395E7F3ECA36A7A867C0DA19D16585146/1 GAAGAGAGCTTTATGAGTCTCATGGCTAAATCTACACTGATGAGGGCAGTGACCCGGAGGCTGGTTTATTAGTATGAAAAAGTACGTCCAC TGATAAAACT + FEE=FF@EE8CDDCC>@@DD299@;+>:@<19<@>E;EEE2,@:=EEE=-7,7<:ADA@9B4B46<AA################################# @FEAB1E450AF92466520964FD2B39E052AE07D3ECCE6C92460399749F597405B2FEB75F602573E255148F745AE88145BF/1 GTTCAGGGTGAGTCGAATGATCCCTTGCCCGCATTCAGCGGAACTGTTGAATATGGGCAAATTCAGGGAACAATAGACAACTTTCAGGAAC TCAATGTGCA + HHHHFHHHDFE@FFFBGGEBCGEGGFGHHFHGHGCGGHGHGHGGHCC>=FDC?CDBEEBE+>A;5@AB;?0<<0@@C@ABEEE/.@:>::.7>>> @:6?:A Public key encryption + base64 encoding
  • 21. Future directions  Complete decoupling of read mapping as proof-of-work  Docker-based plugins to change the “work”  Miners -> employees  Authority servers -> employers  Root authority -> central bank  Web-based GUI for “job descriptions”  A job bulletin board for different employers
  • 22. Conclusions  HTS data is monotonically increasing  Computational analysis is the bottleneck  Additional burden due to reference updates  But (fortunately) embarrassingly parallel problem  Voluntary grids may help  “Market will decide”  Coins give motivation to miners since alignment is compute intensive  Decentralized transaction with centralized mining
  • 23. Resources  Coinami web page (created as part of senior project)  https://coinami.github.io/  GitHub page (code not public yet)  https://github.com/coinami
  • 24. Acknowledgements Bilkent Atalay Mert İleri (now at MIT) Halil İbrahim Özercan (now senior student) Alper Gündoğdu (now at Facebook) Ahmet Kerim Şenol (now at Google) M. Yusuf Özkaya (now at Georgia Tech) Travel fellowship to Halil I. Özercan
  • 25. Minin’, minin’, minin’ Though the reads are mappin’ Keep them coins signing’ Rawhide!

Notas do Editor

  1. Waiting transactions are added to block by the authority. However, the client is responsible for broadcasting the block. It lowers the networking cost on server.