Like many other scientific disciplines, astronomy has witnessed a tremendous growth over the past two decades. As a result, astronomers have become very efficient at creating massive datasets that describe the properties of our nearby universe. Given its primarily visual focus, and its potential to address fundamental questions about humanity, astronomy is in the unique position to be an ideal testbed for algorithms and techniques that address "big data" problems. As a result, the astronomical community, in coordination with all NASA data centers, is trying to cope with "big data" by making use of novel approaches to data mining, data visualization, and data distribution.
Here I will present two applications that showcase how astronomers are tackling the problems associated with the wealth of data at their disposal. First I will describe how GPUs, coupled with clever image processing and Mathematica, are helping the search for extraterrestrial planets. Then I will show how citizen science is changing the social fabric of astronomy and redefining what scientific questions can be addressed by large datasets.
25. planet-star separation. The effective sen- energy from the recently formed planets. During
Download
HR 8799bcd discov-
es after the light from
ht host star has been
d by ADI processing.
left) A Keck image
in July 2004. (Upper
emini discovery ADI
cquired in October
oth b and c are de-
at the two epochs.
m) A color image of
tary system produced
ining the J-, H-, and
images obtained at
telescope in July (H)
tember (J and Ks)
he inner part of the
image has been ro-
1° to compensate for
l motion of d between
September. The central
masked out in the up-
es but left unmasked
ower to clearly show
kle noise level near d.
26. Target Image Reference Image
20 40 60 80 20 40 60 80
80 80 80 80
60 60 60 60
40 40 40 40
20 20 20 20
20 40 60 80 20 40 60 80
Science target with its three Reference star with no planet
Planets alone
planets (invisible for now) (diffraction pattern)
CREDIT: STScI/Remi Soummer
27. Target ReferenceImage
Target Image Reference Image 20 40 60 80
20 40 60 80 20 40 60 80
80 80
80 80 80 80
60 60
60 60 60 60
40 40 40 40
40 40
20 20 20 20 20 20
20 40 60 80 20 40 60 80 20 40 60 80
Science target with its three Reference star with no planet PSF subtractions residuals
planets (invisible for now) (diffraction pattern)
This is why these planets were not found in 1998
CREDIT: STScI/Remi Soummer
29. Lafrenière 2009 Classical subtraction
Coronagraphic
data
1998 NICMOS orientation 1 1998 NICMOS orientation 2
Figure 1: Note: all images (excluding top middle) are on the same scale. Top left: NICMOS coronagraphic data from
CREDIT: STScI/Remi
HR8799 from program 7226 (cycle 7). Top Middle: Left: (black & white image) Detection of HR8799b in the same data Soummer
32. understood data
all data
mysteries
Tomorrow’s hope
Investing in Human-Computer relations
cool mysteries
understood data
CREDIT: Zooniverse/Pamela Gay
33. understood data
all data
mysteries
Tomorrow’s hope
Investing in Human-Computer relations
cool mysteries
understood data
CREDIT: Zooniverse/Pamela Gay
34. What is Citizen Science?
Science Problem
+ Volunteers from the
Public
New Knowledge
CREDIT: Zooniverse/Pamela Gay
45. Give peas a chance! Green compact galaxies
CREDIT: Zooniverse/Pamela Gay
46.
47. : Project Overview
Tasks:
Mark Craters
Indicate Boulderiness
(None, Some, Many)
Mark Spacecraft Debris
Mark Crater Features
(Bench / Mound / Flat,
Dark haloed, Fresh white,
elongate pits)
Mark Linear Features
(Boulder tracks, Crater chain,
Sinuous channels,
Other linear feature)
48. : Project Overview
Tasks:
Mark Craters
Indicate Boulderiness
(None, Some, Many)
Mark Spacecraft Debris
Mark Crater Features
(Bench / Mound / Flat,
Dark haloed, Fresh white,
elongate pits)
Mark Linear Features
(Boulder tracks, Crater chain,
Sinuous channels,
Other linear feature)
49.
50.
51.
52. *!!!!"
)!!!!" More than
(!!!!"
60,000 images
'!!!!"
classified by
&!!!!"
1022 people
between
%!!!!"
$!!!!"
#!!!!"
9/15 and 9/20
!"
'+&+#!"!,!!" '+$&+#!"!,!!" (+#%+#!"!,!!" )+%+#!"!,!!" )+$%+#!"!,!!" *+#$+#!"!,!!" -+#+#!"!,!!" -+$#+#!"!,!!" #!+##+#!"!,!!"
53. This research was funded funded under NASA ROSES grant NNX09AD34G and NSF DRL 0917608.
Additional support provided by the NASA Lunar Science Institute and the Sloan Digital Sky Survey.
54. HUBBLE MISSION
PROJECT
AN X-BOX GAME PROTOTYPE
DATE CLIENT
2010-08-19 KYLE HARRIS, MATT KAISER & ALBERTO CONTI
These outstanding results give us hope that as we move into tomorrows’ flood, citizen scientists will be able to play a meaningful role in looking for the new discoveries hiding in a sea of objects computers can’t quite handle. \n\nBut so far, I’ve only asked 1 of my two key questions. While I can say “Yes - they can contribute meaningful data” I haven’t yet answered, can they do it in the need amounts. \n
These outstanding results give us hope that as we move into tomorrows’ flood, citizen scientists will be able to play a meaningful role in looking for the new discoveries hiding in a sea of objects computers can’t quite handle. \n\nBut so far, I’ve only asked 1 of my two key questions. While I can say “Yes - they can contribute meaningful data” I haven’t yet answered, can they do it in the need amounts. \n
These outstanding results give us hope that as we move into tomorrows’ flood, citizen scientists will be able to play a meaningful role in looking for the new discoveries hiding in a sea of objects computers can’t quite handle. \n\nBut so far, I’ve only asked 1 of my two key questions. While I can say “Yes - they can contribute meaningful data” I haven’t yet answered, can they do it in the need amounts. \n
These outstanding results give us hope that as we move into tomorrows’ flood, citizen scientists will be able to play a meaningful role in looking for the new discoveries hiding in a sea of objects computers can’t quite handle. \n\nBut so far, I’ve only asked 1 of my two key questions. While I can say “Yes - they can contribute meaningful data” I haven’t yet answered, can they do it in the need amounts. \n
The idea of citizen science isn’t new, but it is having a modern day renaissance. If you’re like me, you may have heard this particular phrase - citizen science - for the first time only in the past couple of years. While you may not have heard this trendy meme before, you probably did know people doing it. You might have known bird watchers participating in the Great Backyard Bird Count or the Christmas Day Bird Count and reporting their results to the Audubon Society or the Cornell Lab of Ornithology. You might have friends who are amateur astronomers who chase novae or track asteroids and report their data to the American Association of Variable Star Astronomers or the Minor Planet Center. All these people who are acquiring data and sharing what they see are citizen scientists. This is what citizen science is: Everyday people making it possible for questions to be answered through their contributions of work, data, and time. \n
Solar storm watch builds on the Galaxy Zoo project, which is perhaps the largest citizen science project to date. \n
The Galaxy Zoo project is one shining example that yes, the public can make powerful contributions. Launched in 2007, Galaxy Zoo has asked the public to look at images from the Sloan Digital Sky Survey and to report what they see through a series of easy to understand buttons. In the original galaxy zoo project, users simply reported if what they were seeing was an elliptical galaxy, a spiral that was edge on, or clockwise or counterclockwise, a merger, or a star or image error. \n
\n
\n
\n
The current galaxy zoo project is a bit more complicated, asking users to answer a series of more detailed questions to allow us to better understand the details of the brightest and galaxies in the SDSS. \n\nThe classifications made by people for GZ1 and GZ2 have thus far lead to over 20 submitted and accepted journal articles. In Lintott et al 2008, it was shown that by asking 30 or more people to classify each of our galaxies, we can get an aggregate result that is statistically as good, and sometimes even better, than comes from a single professional classifying a collection of images. Beyond just allowing a better statistical understanding of the distribution of galaxies, this project has also made a series of unexpected discoveries and created statistically significant samples of rare objects.\n
We have have found roughly 1000 blue elliptical galaxies\n\nSFR 0.5-50 solar masses / year\n0.02<z <0.05, corresponding to luminosities of approximately L*&#x2217;and above\nBlue early-type galaxies tend to live in lower-density environments than &#x2018;normal&#x2019; red sequence early-types and makeup 5.7&#xB1;0.4% of the low-redshift early-type galaxy population.\n\n
We have built a collection of 3000 galaxies in various stages of a major merger\n
30\n3000 candidates\n1000 good\n
One of our users, Hanny Van Arkel - A dutch elementary school teacher - discovered the first known light echo of a now dormant quasar.\n
And a group of users in the forums, bent on being puny, found a novel class of small, star bursting galaxies, affectionately called &#x201C;peas.&#x201D;\n\n439\n