0 views

Uploaded by Pike Patterson

CS240B:ADVANCED DATA BASES and KNOWLEDGE BASES

- hs tl func
- Math Worksheet-Function as Model
- Conceptual Mathematics A first introduction to category theory
- TRANSFORMATION
- tab program description
- Answers for Practice for Test 1 MAT 120
- Scimakelatex.15969.None
- On the Improvement of Randomized Algorithms
- An Investigation on Single Machine Total Weighted
- .._TenthClass_BitBanks_MathsEM_2-Functions
- Sample Solution Exercise PLC Programmingplc
- 6 multimedia lesson idea 2016
- El Rol de La as Discretas
- Functions Domain Range
- Chap 2 Practice
- On the value distribution properties of the Smarandache double-factorial function
- lectur-120121051527-phpapp01
- L.No.15-A
- Deconstructing Systems Using EgreTheory
- IJCP1

You are on page 1of 3

CS240B

Tuesday and Thursday 14:00--15:50

Fall 2018

Fall 2018

Syllabus Supporting Data Stream Mining Applications

Notes

Instructor: Carlo Zaniolo

Office Hours: Tuesdays and Thursdays, 16:00--17:00.

Home

Talks First Assignement (Due on Tuesday Oct. 9) Some answers for 2.1, 2.2

Study the following papers:

B. Babcock, S. Babu, M. Datar, R. Motwani, and J. Widom: Models and Issues in Data

Stream Systems. PODS 2002.

SQL-MR.

New SQL OLAP Functions for everyone, by K.M. Guion.

Task 1.1: Write a four page (about) report on SQL-MR focusing on the things that you can

do with SQL-MR that cannot be done in SQL-TS,

Task 1.2: Some of the new OLAP functions support windows others do not. List those in the

two groups, and for those in the second group, suggest a general implementation for (i)

unlimited preceding windows, and (ii) physical windows.

-------------------------------------------------------------------------------------------------

Second Assignement (Due on Tuesday Oct. 16) Answers for 2.1, and 2.2.

Study the following papers:

Y. Bai, H. Thakkar, C. Luo, H. Wang and CZaniolo: A Data Stream Language and

System Designed for Power and Extensibility. (CIKM'06), Arlington, Virginia,

USA,

H. Thakkar, N. Laptev, H. Mousavi, B. Mozafari, V. Russo and C. Zaniolo: SMM: A

data stream management system for Knowledge Discovery. ICDE 2011.

Task 2.1: Using a syntax based on that of notes and the two reference above, express a user-

defined aggregate d_count to perform the exact count of distinct values in a window on a data

stream. Your window aggregate could, e.g., be called as follows:

SELECT col_name1, d_count(col_name2) OVER (ROWS 99999 PRECEDING) FROM

my_stream

Task 2.2: Using the same syntax, write a UDA that implements the RANK function (without

window or with a unlimited-preceding window). Then extend this definition to support

windows.

------------------------------------------------------------------------------------------------

Third Assignement (Due on Tuesday, Oct. 23) Answers for 3.1, 3.2, and 3.3.

Yijian Bai and Carlo Zaniolo: Minimizing Latency and Memory in DSMS: A Unified

Approach to Quasi-Optimal Scheduling . The Second International Workshop on Scalable

Stream Processing Systems, March 29, 2008, Nantes, France.

Y. Bai, H. Thakkar, H. Wang, and C. Zaniolo. Time-stamp Management and

QueryExecution in Data Stream Management Systems. IEEE Internet Computing, 12(6):

2008.

Jure Leskovec, Anand Rajaraman, Jeff Ullman, Mining Data Streams - The Stanford

University InfoLab. You only need to study sections 4.3 and 4.4. But the rest is also

interesting. http://infolab.stanford.edu/~ullman/mmds/ch4.pdf

Task 3.1: We showed that binary operators, such as union and joins, are subject to idle-

wating problem. But this problem can also occur in some unary operators. Say for instance

that an aggregate is called on a time-stamped tumble window (or on a timestamped window

with slides). Discuss this situation, its idle-waiting problem, and how it can be solved (using

concepts and notation from the paper by Bai and Zaniolo above ).

Task 3.2: Propose a simple generalization of the chain algorithm to minimize memory for

the following situations:

(A) Two or more simple paths without forks,

Task 3.3: Write the window UDA to compute the standard deviation on a data stream.

Explain the math formulas you use in the expire of the window UDA---you might want to

consult textbooks or the web to find the best formula.

-------------------------------------------------------------------------------------------------

Fourth Assignement (Due on Tuesday Oct. 30) Answers for 4.1, and 4.2.

Study: Chapter 3: A Survey of Classification Methods in Data Streams) from DATA

STREAMS: MODELS AND ALGORITHMS DATA STREAMS. CHARU C.

AGGARWAL editor, Kluwer Academic Publishers.

Chapter 3 is at pages 39-60 of the following pdf document.

N. Laptev et al. : Extendin Relational query Languages for Data Streams. pp. 361--386 of

Data StreamManagement--Processing High-Speed Data Streams. Springer, 2017.

Task 4.1: Express the Flajolet-Martin's distinct_count sketch as a user-defined aggregate

mamed dcount_sketch, to be called in the same way as d_count. You can assume that you

have available a function LmostbitH(X) that return K, where the K position contains a 1,

whereas all the position to its right are zeros, for the value returned by a randomizing hash

function H(X).

Task 4.2: Assume that you have a stream of temperature readings temperature(Celsius

Integer) that start everyday at time 00:01 and end at time 23:59. At the end of each day, we

want to have 10,000 temperature samples stored into a table tenKsamples(Rowno integer,

Celsius Integer). We do not know how many temperature readings are going to arrive every day,

except that their number is significantly larger than 10,000. Please write a UDA that uses

the reservoiralgorithm to populate tenKsamples(Rowno , Celsius) with 10,000 random samples

taken fromtemperature(Celsius Integer), which is then processed and reset to empty at

midnight. You can assume that the system support a function random(K), which given a positive

integer K returns a random integer between 1 and K.

-----------------------------------------------------------------------------------------------------

-----------------------------------------------------------------------------------------------------

-----------------------------------------------------------------------------------------------------

Research Report (as your takhom final) Due on or before Dec. 14:

Done individually or by pairs.

Holidays

- hs tl funcUploaded byapi-302575416
- Math Worksheet-Function as ModelUploaded byEducareLab
- Conceptual Mathematics A first introduction to category theoryUploaded bygertjanbisschop
- TRANSFORMATIONUploaded bymarriyum
- tab program descriptionUploaded byT. Russell Hanes
- Answers for Practice for Test 1 MAT 120Uploaded byagentchap
- Scimakelatex.15969.NoneUploaded byGonzalo Gonzales
- On the Improvement of Randomized AlgorithmsUploaded bygonzalez
- An Investigation on Single Machine Total WeightedUploaded bymihaiM500
- .._TenthClass_BitBanks_MathsEM_2-FunctionsUploaded bybikshapati
- Sample Solution Exercise PLC ProgrammingplcUploaded byImran Al Noor
- 6 multimedia lesson idea 2016Uploaded byapi-346660186
- Functions Domain RangeUploaded byAiram Anitsirhc Cuasito Manzano
- El Rol de La as DiscretasUploaded bysoleduar
- Chap 2 PracticeUploaded byteachopensource
- On the value distribution properties of the Smarandache double-factorial functionUploaded byRyanElias
- lectur-120121051527-phpapp01Uploaded byshemsedin shukre
- L.No.15-AUploaded byThiyagu Vasu
- Deconstructing Systems Using EgreTheoryUploaded bydjclocks
- IJCP1Uploaded byjunie9201
- Ci180004_Lab01Uploaded byhafiz
- scimakelatex.10139.esocohUploaded byLK
- EffyUploaded byAnonymous oXL3YI
- guided notes lesson 1Uploaded byapi-238465767
- Table of Domain and RangeUploaded byAria Carino
- study guide term 1 final examUploaded byapi-366582437
- On the Development of Object-Oriented LanguagesUploaded byJon Snow
- fibonacci lesson planUploaded byapi-250240427
- Lecture 1 - Relations and FunctionsUploaded byAnthony Balatar
- Scimakelatex.24470.Johns.jaysUploaded byJohn M

- What Are Good Books for Learning About Math? Not Textbooks, But Books That Provide Insight Into MathUploaded byPike Patterson
- program explorer cs 33 uclaUploaded byPike Patterson
- Blank General Journal form.pdfUploaded byPike Patterson
- 4314Uploaded byPike Patterson
- The Economic History of MexicoUploaded byPike Patterson
- Project 4 Ucla Cs 31Uploaded byPike Patterson

- Landscape EvolutionUploaded byglorfindel123456
- AdjustmentUploaded byMuhammad Ilham Hidayat AS
- Nydle: Lesson 2: Public Policy PlanUploaded byAmanda Wright
- BIOLOGY FRESH WATERUploaded byNdiukaye Francis Agyake
- Isothermal Titration Calorimetry.docUploaded byLupu Andreea-Craita
- Italian Books IllustrationsUploaded bylealgar
- 10 Process Text Stream Using Filters.txtUploaded byZumoariku Rinkutokuarikuari
- research paper daftUploaded byapi-315583213
- Broadband Wireless Communication Systems_Channel ModelingUploaded byDasun Srimal Athukorala
- docslide.us_sfl3011-physic-laboratory-report-experiment-1.pdfUploaded byLeelaSree
- CV_doc_MD.Uploaded byTina Cecîrlan
- Mathematics - 4 & 5 Standard - Harrison, Huizink, Sproat-Clements, Torres-Skoumal - Oxford 2016.pdfUploaded bySergio Curay
- Judge Sues DCA for Fixing CasesUploaded byartistpres
- HungarianUploaded byJunaid Ahmed
- GURPS 4e Dungeon Fantasy Adventure 1 Mirror of the Fire DemonUploaded byJames
- Cathodic Protection - Quality Control ProcedureUploaded byAhmed Gomaa
- LaetrileUploaded byCarlos Elias Mancilla Sanchez
- 88BE1E36-4D87-4B24-9C29-D565D0D368A0. Judicial Appointment in India _CivilUploaded byRazor Rock
- PGDJ Ist Sem Neotech College, AmbikapurUploaded byAmit Sharma
- Wagner, Siegfried, Marx, and the New World OrderUploaded byJeremy James
- Case US-64 ControversyUploaded byBiwesh
- 5 DIRECTED WRITING (HOPs 2).pdfUploaded bymmustasari
- ryan resume 2013Uploaded byapi-218533920
- Tp Nfuentesrivera 2006 06Uploaded bynoahly
- Image Identification SystemUploaded bySwathi As
- Labor2 CasesUploaded bypiptipayb
- JA Maria M SipagUploaded byNoznuag
- Sansi OriginalUploaded byRebeca Campos Ferreira
- United States v. A. Reginald Eaves, 877 F.2d 943, 11th Cir. (1989)Uploaded byScribd Government Docs
- Bac.2017.Lv2.Afrique.corrigeUploaded byarmorin