Sie sind auf Seite 1von 17

Q1. Perform the following operations on Sales.csv file.

(a)Consider the Sales.csv file and summarize the total number of customers in each country.

(b) Calculate the cumulative sales of all products after each month.

1|Page

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

(c) Calculate the monthly % increase or % decrease of sales of all products in each country.

(d) Summarize the date to show which product has maximum sales over all the products in all
the countries

2|Page

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Q2. Perform the following operations on Sales.csv file


(a) Compare the Product which has maximum sales with each other product as a percentage

(b) Summarize the sales of all products over all years broken down by each quarter then by each
month.(subtotals).

3|Page

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

(c) Compare the Product which has maximum sales with each other product as a percentage
Row Labels
Classic Cars
Motorcycles
Planes
Ships
Trains
Trucks and
Buses
Vintage Cars
Grand Total

Max of
SALES
100.00
%
99.05%
83.88%
58.00%
74.80%
73.69%
117.35
%

Q3. Perform the following operations on FoodSales.csv


(b) Summarize the data to show which sales person has shown maximum sales of all products in
each region.

(c) Given the cost and selling price of each product, find out the maximum profit gained by all
products over each region by each month.

4|Page

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Q6. Convert a .cvs file it into Relational database table after filtering records
Step 1: Open Pentaho Data Integration tool.
Step 2: Go to Tranformation on left side under view tab.
Step 3: Select input option.
Step 4: Select Text file Input, drag and drop it to the right blank side.
Step 5: Click on text input file.
Step 6: Go to browse on far right.
Step 7: Select data intrgration>>samples>>transformations>>files>>sales_data.
5|Page

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Step 8: click on add.

Step 9: Go to content tab.


Step 10: Change the separator from ; to ,
Step 11: go to fields tab.
Step 12: select get fields option (100 sample lines).

6|Page

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Step 13: select preview rows.


Step 14: click on ok.
Step 15: Select Flow from left in design view.
Step 16: Select filter rows, drag and drop it on the right side.
Step 17: Press shift key, click on text input file and then on the filter rows.
An arrow from text input file to filter rows comes up.
Step 18: Click on filter rows.
Step 19: Go to <field> under condition option.
Step 20: select country.
Step 21: Go to = option.
Step 22: select is not null.
Step 23: click ok.

7|Page

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Step 24: Select output on left of the design view.


Step 25: Select table output, drag and drop on right side.
Step 26: Press shift key, click on filter rows and then on table output.
Step 27: select result is true.
An arrow from filter rows to table output comes up

8|Page

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Step 28: Go to control panel>>administrative tools>>Data sources (ODBC).


Step 29: click on add.
Step 30: Select Oracle in MS SQL Server and then finish

Step 31: Give Data Source Name: VIBHOR-PC, TNS Service Name: MS Sql Server, User ID:
sa
Step 32: Click on test connection.
9|Page

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Give Service name: MS Sql Server, User name: vm, Password: vibhor
Step 33: click ok.
Step 34: Go to table output in pentaho and click new.
Step 35: Select oracle in connection type, ODBC in access.
Give DSN name: bciit, User name: vm, password: vibhor
Step 36: click test.
Step 37: Give any Target table and select Truncate table.
Step 38: Click on SQL.
Step 39: Click on execute.

Q9. Perform classification of weather data using Weka explorer and show the result in
graph. Show in what weather conditions match should be played
Step1: Go to weka tool and execute .jar file
A GUI chooser will open

10 | P a g e

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Step2: Click on Explorer


Weka explorer will open .
Step 3: click on open file > then open folder named DATA and select weather.arff file
This will open

Step 4: Now click on classify > choose> trees> select J48


Step5: click on Start

11 | P a g e

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Start will show the details of the file..


Step 6: Right click on tree J48 (part highlighted in blue) and select visualize tree.
This will show a tree format of data..

Step 7: Right click on tree J48 (part highlighted in blue) and select visualize classifier errors.
Move jitter towards right for better view
12 | P a g e

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Step8: select x: outlook


And y: play

Now there are 4 scenerios


Red square
Blue square
Red cross
Blue cross
Each of this will tell the weather conditions and will tell whether the match will be played or not
Step9: Click on eact of them one by one to know the status

The above are the result for 4 scenerios. Telling that match will be played or not.
Q10Perform Associations Rule mining on the given data in Weka Explorer.
13 | P a g e

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Step1: Go to weka tool and execute .jar file


A GUI chooser will open

Step2: Click on Explorer


Weka explorer will open .
Open a file from data
Step 3: Select associate tab.
Step 4: Select choose under associator.
Step 5: Select Associations>>filteredAssociator

Step 6: click on start


14 | P a g e

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Q11. Perform k-means clustering on the given data to show different clusters.
Step1: Go to weka tool and execute .jar file
A GUI chooser will open

Step2: Click on Explorer


Weka explorer will open .
Open a file from data..
Step 3: Select clustering tab.
Step 4: Click choose under cluster option.
Step 5: Select simpleKmeans
15 | P a g e

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Step 6: click on start.

Step 7: right click on 2:00:38 - SimpleKMeans under result list.


Step 8: select visualize cluster assignments.

16 | P a g e

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

17 | P a g e

Name:ARSALAN AHMED KHAN


Enrol. No. 00111104413

Das könnte Ihnen auch gefallen