Big Data Practice College Quiz 2 Questions
Question 1
______ is a familiar example of a data product based on well-built predictive models that do not achieve an optimal objective.
Select one
- a.Recommendation engine
- b.Predictive Revenue Modeling
- c.Python data products
- d.Big data forecasting
The correct answer is: Recommendation engine
Question 2
A four-step approach which is used to build or design data products:
- a.Big Data Principle Approach
- b.Drivetrain Approach
- c.Predict-and-optimise framework
- d.CRISP-DM
The correct answer is: Drivetrain Approach
Question 3
You are presented with the following data structure:
1 2 3 4 5 6 7 8 |
|
What type of database allows this sort of structure to be stored and retrieved using a non-structured querying language?
- a.MongoDB
- b.RDBMS
- c.OLAP
- d.Oracle
The correct answer is: MongoDB
Question 4
Your boss asks you about Hadoop Distributed File System, wanting to know how you would expand the storage capacity of your current Hadoop environment. You tell him you can:
- a.Add more servers to expand the storage capabilities.
- b.Start a new instance of Hadoop to handle the overflow.
- c.You canʼt add more space given the limitations of the program
- d.Add an additional instance of Hadoop to your cluster configuration
The correct answer is: Add more servers to expand the storage capabilities.
Question 5
Name the default engine used in new deployments of MongoDB for versions 3.2 or higher.
- a.SQL DBMS
- b.Snowflake
- c.WiredTiger
- d.MMAPv1
The correct answer is: WiredTiger
Question 6
Hadoop was started with Doug Cutting and Mike Cafarella in the year __ when they both started to work on ___ project.
- a.2002, Apache Nutch
- b.2001, Google NFS
- c.2000, Hadoop MapReduce
- d.2003, Hadoop YARN
The correct answer is: 2002, Apache Nutch
Question 7
Key-Value Storage, Document Storage, Graph Storage are types of:
- a.NoSQL
- b.DB2
- c.Oracle
- d.SQL Server
The correct answer is: NoSQL
Question 8
You are working on a Big Data analytics team and need to select an application that will allow for faster processing of a large amount of data. You want that processing to happen mostly in memory using a computing engine that allows for running most of those computations in memory. Which Big data tool is best suited in this context?
Select one
- a.Apache Spark
- b.HADOOP
- c.MangoDB
- d.Apache Kafka
The correct answer is: Apache Spark
Question 9
The following are cluster mangers except:
Select one: * a.YARN * b.Standalone deployment * c.Groovy * d.Apache Mesos
The correct answer is: Groovy
Question 10
Of the alternatives to Hadoop noted below, which claims to be more flexible and removes the limits on the number of files processed?
- a.CEPH
- b.ZFS
- c.HBase
- d.Cassandra
The correct answer is: CEPH