Week 6 (Big Data)
Part A (4 Mark)
Exercise 1: Data Science (1 mark)
Read the following article at and answer these questions:
Read the following article at and answer these questions:
1. What is Data Science?
2. According to IBM estimation, what is the percent of the data in the world today that has been created in the past two years?
3. What is the value of petabyte storage?
2. According to IBM estimation, what is the percent of the data in the world today that has been created in the past two years?
3. What is the value of petabyte storage?
Exercise 2: Characteristics of Big Data(2 marks)
Read the following research paper from IEEE Xplore Digital Library
Ali-ud-din Khan, M.; Uddin, M.F.; Gupta, N., "Seven V's of Big Data understanding Big Data to extract value," American Society for Engineering Education (ASEE Zone 1), 2014 Zone 1 Conference of the pp.1,5, 3-5 April 2014
and answer below-mentioned questions:
Summarize author’s motivation (in one paragraph)
What are the 7 v’s mentioned in the paper? Briefly describe each V in one paragraph.
Explore the author’s future work by using the reference [4] in the research paper. Summarise your understanding how Big Data can improve the healthcare sector in 300 words.
Exercise 3: Big Data Platform (1 mark)
With the purpose of building a big data platform - one has to obtain, organize and analyze the big data. Study the given links and answer the following questions based on the links: Check the videos and change the wording
Please note: You are fortified to watch all the videos in the series from Oracle.
How to acquire big data for enterprises and how it can be used?
How to organize and handle the big data?
What are the analyses that can be done using big data?
How to organize and handle the big data?
What are the analyses that can be done using big data?
Part B (4 marks)
Answers to Part B should be based on well-cited article/videos – name the references used in your answer.
Exercise 4: Big Data Products (1 mark)
Google is an expert at generating data products. Here are few examples from Google. Define the below-mentioned products and clarify how the large-scale data is used efficiently in these products.
Google is an expert at generating data products. Here are few examples from Google. Define the below-mentioned products and clarify how the large-scale data is used efficiently in these products.
a. Google’s PageRank
b. Google’s Trends
c. Google’s Flu Trends
d. Google’s Spell Checker
e. Like Google – Facebook and LinkedIn also use large-scale data effectively. How?
b. Google’s Trends
c. Google’s Flu Trends
d. Google’s Spell Checker
e. Like Google – Facebook and LinkedIn also use large-scale data effectively. How?
Exercise 5: Big Data Tools (2 marks)
1. Briefly explain why a traditional relational database (RDBS) is not effectively used to store big data?
2. What is NoSQL Database?
3. Name and briefly describe at least 5 NoSQL Databases
4. What is MapReduce and how it works?
5. Briefly describe some notable MapReduce products (at least 5)
6. Amazon’s S3 service lets to store large chunks of data on an online service. List some 5 features for Amazon’s S3 service.
7. Getting the concise, valuable information from a set of data can be challenging. We need statistical analysis tool to deal with Big Data. Name and describe some (at least 3) statistical analysis tools.
1. Briefly explain why a traditional relational database (RDBS) is not effectively used to store big data?
2. What is NoSQL Database?
3. Name and briefly describe at least 5 NoSQL Databases
4. What is MapReduce and how it works?
5. Briefly describe some notable MapReduce products (at least 5)
6. Amazon’s S3 service lets to store large chunks of data on an online service. List some 5 features for Amazon’s S3 service.
7. Getting the concise, valuable information from a set of data can be challenging. We need statistical analysis tool to deal with Big Data. Name and describe some (at least 3) statistical analysis tools.
Exercise 6: Big Data Application (1 mark)
Name 3 industries that should use Big Data – justify your claim in 250 words for each industry using proper references.
No comments:
Post a Comment