Beruflich Dokumente
Kultur Dokumente
EDA
1. Exploratory data analysis should be used to
A. help you search for patterns in your data.
B. spot serious defects in your data that may warrant taking corrective action.
C. help determine whether assumptions of the inferential tests you intend to use may have been violated.
D. all of the above
3. To show a functional relationship between your independent and dependent variables, the graph of choice
would be a
A. line graph. B. histogram. C. pie chart D. scatterplot.
5. In which of the following situations would you not want to use a Pearson correlation coefficient?
A. when the relationship between variables is nonlinear
B. when both of your variables are measured on at least an interval scale
C. when the variances of your distributions are very similar D. all of the above
6. A curve showing a functional relationship that starts off flat, becomes progressively steeper, and shows a single
direction of change is
A. negatively accelerated. B. monotonic
C. positively accelerated. D. both b and c
7. A ________ distribution has most scores collected about the center and is symmetrical about its midpoint.
A. functional B. normal B. monotonic D. bimodal
9. A functional graph that shows a uniformly increasing or decreasing functional relationship is said to be
A. monotonic. B. negatively skewed. C. normal. D. positively skewed.
10. If you have discrete group data, such as months of the year, age group, shoe sizes, and animals. Which is best
to explain?
USM’s Shriram Mantri Vidyanidhi Info Tech Academy
PG DBDA Feb 19 Data Visualization Question Bank
11. Which graph is better used when data needs to be classified or categorize?
A. stack bar B. Pie chart C. histogram D. None of the above
14. From which plot you will come to the distribution of the target variable?
A. histogram B. pie chart C. bar D. Pareto chart
15. TrueFalse: The quantilequantile (qq) plot is a graphical technique for determining if two data sets come from
populations with a common distribution.
A. True B. False
16. TrueFalse: In Boxplot the middle line inside the box display the mean of the distribution
A. True B. False
17. TrueFalse: For Numeric vs Numeric data scatterplot is the best representation.
A. True B. False
18. TrueFalse: For Bivariant data, correlogram or corr plot show the correlation of each variable.
A. True B. False
19. TrueFalse: the height of the bar corresponds to the value of each category.
A. True B. False
20. TrueFalse: The height of the resulting Stacked Bar shows the combined result of the groups.
A. True B. False
3) Pandas does easy handling of missing data in floating point as well as nonfloating point data?
A. True B. False
7) Pivot table can aggregate the data and summarize it by grouping the columns
A. True B. False
8) _______ is a convenient method for combining the columns of two potentially differentlyindexed DataFrames
into a single result DataFrame.
A. Concatenate B.Merge C. Join D. Collaborate
9) Dimensions should match along the axis you are _______ on.
A. concatenating B. merging C. joining D. collaborating
10) Series can have axis labels and it can be indexed by a label
A. True B. False
18) _______ is a visualisation library that provides a highlevel interface to draw attractive statistical graphics.
A. Scrapy B. Seaborn C. Airborn D. Statistica
2. Point out the correct combination with regards to kind keyword for graph plotting:
A. ‘hist’ for histogram B. ‘box’ for boxplot
C. ‘area’ for area plots D. all of the Mentioned
USM’s Shriram Mantri Vidyanidhi Info Tech Academy
PG DBDA Feb 19 Data Visualization Question Bank
Explanation: The kind keyword argument of plot() accepts a handful of values for plots other than the default Line
plot.
4. You can create a scatter plot matrix using the __________ method in pandas.tools.plotting.
A. sca_matrix B. scatter_matrix C. DataFrame.plot D. all of the Mentioned
Explanation: You can create density plots using the Series/DataFrame.plot.
5. Point out the wrong combination with regards to kind keyword for graph plotting:
A. ‘scatter’ for scatter plots B. ‘kde’ for hexagonal bin plots
C. ‘pie’ for pie plots D. none of the Mentioned
Explanation: kde is used for density plots.
6. Which of the following plots are used to check if a data set or time series is random ?
A. Lag B. Random C. Lead D. None of the Mentioned
Explanation: Random data should not exhibit any structure in the lag plot.
8. Which of the following plots are often used for checking randomness in time series ?
A. Autocausation B. Autorank C. Autocorrelation D. None of the Mentioned
Explanation: If time series is random, such autocorrelations should be near zero for any and all timelag
separations.
Tableau
1. Tableau treats date
A. Specially by defining hierarchy for user
B. Treats date as any other field
C. Converts date to number
D. None of the above
4. Tableau allows
A. Using data from disparate sources using blending as well as joining
B.Using data from disparate sources using only blending
C. Does not work with disparate sources
D. None of the above
8. Tableau allows to
A. Store metadata
B. Store Metadata, ability to rename fields, pivoting the data
C. Do not allow storing metadata
D. None of the above
9. Tableau has
A. Stories which allow better communication
B. Dashboards and stories and together they can be used for communication
C. No good features to communicate data
D. None of the above
16. Is it possible to deploy a URL action on a dashboard object to open a Web Page within a dashboard rather than
opening the system’s web browser?
A. True, with the use of Tableau Server
B. True, with the use of a Web Page object
C. False, not possible
D. True, requires a plug-in
True, with the use of a Web Page object it is possible to deploy a URL action on a dashboard object to open a web
page within a dashboard rather than opening the system’s web browser.
17. The Highlighting action can be disabled for the entire workbook.
A. True B. False
From the toolbar the Highlighting action can be disabled for the entire workbook.
18. A sheet cannot be used within a story directly. Either sheets should be used within a dashboard, or a
dashboard should be used within a story.
A. True B. False
A sheet can be used within a story directly.
20. Is it possible to use measures in the same view multiple times (e.g. SUM of the measure and AVG of the
measure)?
A. No B. Yes
Yes, measures can be used multiple times in the same view.
24. The line shown in the image below is a Reference Line. True or False?
A. true B. false
The line shown in the image is a Trend Line.
USM’s Shriram Mantri Vidyanidhi Info Tech Academy
PG DBDA Feb 19 Data Visualization Question Bank
27. The icon associated with the field that has been grouped is a ______________.
A. Paper Clip B.Set C. Hash D. Equal To
The icon associated with the field that has been grouped is a paper clip.
28. In the West region, which state’s sales fall within the Reference Band starting from average sales of that region
till median of sales? (Perform the below questions in Tableau 9.0 and connect to the Saved Sample – Superstore
dataset)
A. California B. Colorado C. Montana D. New Mexico
29. Create a simple bar chart with Region and Total Expenses from the Sample- Superstore dataset and Sample -
Coffee Chain dataset, respectively. (Establish the link on State). Identify the budgeted profit for the region having
the 2nd highest total expenditure. (Connect to the Sample- Coffee Chain access file using the CoffeeChain Query
table)
A. 84850 B. 87680 C. 80231 D. 84823
30. In 2012, what is the percent contribution of sales for Decaf in the East market? (Perform all the questions in
Tableau 9.0 and connect to the Saved Sample-Superstore dataset)
A. 48.942% B. 54.765% C. 51.231% D. 55.875%
48.942% is the percent contribution of sales of Decaf in 2012 in the East market.
31. In 2013, what is the percentage of total profit for Caffe Mocha falling under Major Market (Market
Size)?(Perform all the questions in Tableau 9.0 and connect to the Saved Sample-Superstore dataset)
A. 60% B. 45% C. 58% D. 55%
In 2013, the percentage of total profit for Caffe Mocha falling under Major Market is 55%.
32. Create a heat map for Product Type, State, and Profit. Which state in the East market has the lowest profit for
Espresso?(Use the Sample- Coffee Chain dataset for the following questions)
A. Florida B. Connecticut C. New York D. New Hampshire
New Hampshire has the lowest profit for Espresso, in the East market.
33. In 2012, what is the difference in budget profit, in Q3 from the previous quarter for major market (Market
Size)? (Use the Sample- Coffee Chain dataset for the following questions)
A. 630 B.-287 C. 667 D. 654
34. In which month did the running sales cross $30,000 for Decaf in Colorado and Florida? (Use the Sample- Coffee
Chain dataset for the following questions)
A. November 2013 B. September 2013 C. May 2013 D. December 2013
35. Create a bar chart with Product Type, Product, and Profit. Identify which of the following
products fall below the overall 99.9% Confidence Interval Distribution (Table across)? (Use the Sample- Coffee
Chain dataset for the following questions)
A. Decaf Espresso B. Green Tea C. Caffe Latte D. Regular Espresso
36. Using quartiles, identify which of the following Espresso product has the highest distribution of sales? (Use the
Sample- Coffee Chain dataset for the following questions)
USM’s Shriram Mantri Vidyanidhi Info Tech Academy
PG DBDA Feb 19 Data Visualization Question Bank
37. In 2013, identify the state with the highest profit in the West market? (Use the Sample- Coffee Chain dataset
for the following questions)
A. Utah B. Nevada C. California D. Washington
38. Create a scatter plot with State, Sales, and Profit. Identify the Trend Line with ‘R-Squared’ value between 0.7 to
0.8? (Use the Sample- Coffee Chain dataset for the following questions)
A. Linear Trend Line B. Logarithmic Trend Line
C. Exponential Trend Line D. Polynomial Trend Line with Degree 2
The Trend Line with ‘R-Squared’ value between 0.7 to 0.8 is a Polynomial Trend Line with Degree 2.
39. Identify the total expenses to sales ratio of the state with the lowest profit. (Use the Sample- Coffee Chain
dataset for the following questions)
A. 47.31% B. 45.58% C. 41.98% D. 40.78%
40. Create a Combined Field with Product and State. Identify the highest selling product and its state. (Use the
Sample- Coffee Chain dataset for the following questions)
A. Colombian, California B. Colombian, Texa
C. Lemon, Neva D. Darjeeling, Iowa
41. What is the contribution of tea to the overall Profit in 2012? (Use the Sample- Coffee Chain dataset for the
following questions)
A. 24.323% B. 22.664% C. 20.416% D. 21.765%
Tableau Multiple Choice Questions For Experienced
42. What is the average profit ratio for all the products starting with C? (Use the Sample- Coffee Chain dataset for
the following questions)
A. 30% B. 25% C. 33% D. 20%
43. What is the distinct count of area codes for the state with the lowest budget margin in small markets? (Use the
Sample- Coffee Chain dataset for the following questions)
A. 3 B. 1 C. 2 D. 6
44. Which product type does not have any of its product within the Top 5 Products by sales? (Use the Sample-
Coffee Chain dataset for the following questions)
A. Tea B. Espresso C. Coffee D. Herbal Tea
45. In the Central region, the Top 5 Products by sales contributed _____ % of the total expenditure. (Use the
Sample- Coffee Chain dataset for the following questions)
A. 48.54% B. 51.66% C. 69.21% D. 54.02%
In the Central region, the Top 5 Products by sales contributed 54.02 % of the total expenditure.
46. Trend Lines can only be used with numeric or date fields.
A. True B. False
USM’s Shriram Mantri Vidyanidhi Info Tech Academy
PG DBDA Feb 19 Data Visualization Question Bank
47. The best trend model for your view would be the one with?
A. R-Squared value closest to 1 B. P-Value more than 1
C. R-Squared value greater than 1 D. R-Squared value equal to P-Value
52. The default join type in case of Blended data sources is?
A.Cross Join B. Inner Join C. Left outer Join D. Full outer Join
55. Using GROUP BY ............ has the effect of removing duplicates from the data.
A. without order by B. with aggregates C. with order by D. without aggregates
57. The JOIN which returns all the records from the right table in conjunction with the matching records from the
left table and if there are no matching values in the left table, it returns NULL. Which is this JOIN?
A. CROSS JOIN B. LEFT Join C. Full OUTER JOIN D. Right JOIN
58. GROUP BY ALL generates all possible groups - even those that do not meet the query's search criteria.
A. True B. False
59. You can combine tables in a partitioned view by using a Union All statement that causes the data from the
separate tables to appear as if they were one table. These tables in a SELECT statement of the view must adhere
to some restrictions like: A table can appear . . . . . . as a part of Union All statement.
A. as many times as possible B. only twice C. only once D. only thrice
62. Having clause is processed after the GROUP BY clause and any aggregate functions.
A. True B. False
64. Related to UNION ALL which one do you think is correct syntax: A, B or both
A. Select * from B
Union all
Select * from C
Order by ID desc
B. Select * from B
Order by ID desc
Union all
Select * from C
Order by ID desc
65. Which one is correct syntax for Where clause in SQL server?
A. SELECT WHERE "Condition" Col1, Col2 FROM "Table" ;
B. SELECT Col1, Col2 FROM "Table" WHERE "condition";
C. SELECT "Condition" Col1, Col2 FROM "Table" WHERE;
D. None of these
68. If you SELECT attributes and use an aggregate function, you must GROUP BY the non-aggregate attributes.
A. True B. False
69. Which type of Inner Join fetches result with redundant data?
A. Left Outer B. Equi C. Cross D. IN
70. What will be the result of running the below UNION ALL query:
A. Select Null B. Select Null C. Union all
72. You want all dates when any employee was hired. Multiple employees were hired on the same date and you
want to see the date only once.
Query - 1
Select distinct hiredate
From hr.employee
Order by hiredate;
Query - 2
Select hiredate
From hr.employees
Group by hiredate
Order by hiredate;
Which of the above query is valid?
A. Both B. Query – 2 C. Query – 1
USM’s Shriram Mantri Vidyanidhi Info Tech Academy
PG DBDA Feb 19 Data Visualization Question Bank
74. For the purposes of ............, null values are considered equal to other nulls and are grouped together into a
single result row.
A. Having B. Group By C. None of these D. Both Having & Group By
77. Is there any limit for number of predicates/conditions to be added in a Where clause?
A. False B. True
78. Below query is run in SQL Server 2012, is this query valid or invalid:
Select count(*) as X
from Table_Name
Group by ()
A. Valid B. Invalid
79. In the context of MS SQL SERVER, with the exception of ............ column(s), any column can participate in the
GROUP BY clause.
A. ntext B. bit C. text D. All of these D. image
82. The sequence of the columns in a GROUP BY clause has no effect in the ordering of the output.
A. True B. False