Watch the video at Bloomberg
Our group works on smarter healthcare project. Our task is to find all repetitive sequences in human and rat chromosomes. Those sequences are called ultra conserved elements (UCE) which highly preserved in genome, remaining unchanged for nearly 300 million years. By setting up our own Hadoop Cluster (one master node and seven data nodes), we can run Java MapReduce program on Large Scale Genome DataSet in short period of time. We designed and implemented our Java MapReduce program to obtain UCE and stored it to HBase.
Big data is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications. It has been the buzzword of the last couple years, and many businesses today want to “Get Started with Big Data”.
When I start discussing with clients what they want to do with big data, more often than not I get puzzled looks. It is important to have the preparedness to “get started with big data” by having:
- Pinpointed a line of business in the company to get started with the use cases
- Identified use cases that serves a true business needs within that line of business
- Verified that there’s indeed large amounts of meaningful data available to support the use cases
Above is the snapshot of the mindmap showing some sample use cases. You can download the original mindmap from the links at the original post.
This is an interesting analysis on adoption of smarter planet solutions by various industries. It leverages a mindmap to organize the challenges and advantages for industries in embracing the smarter systems.
Here’s the snapshot of the mindmap from the link: