Harshit Saxena ZS

Hochgeladen von

Prakhar Cm

0% fanden dieses Dokument nützlich (0 Abstimmungen)

21 Ansichten9 Seiten

ppt zs

Copyright

Verfügbare Formate

PPTX, PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

ppt zs

Copyright:

Verfügbare Formate

Als PPTX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

21 Ansichten9 Seiten

Harshit Saxena ZS

Hochgeladen von

Prakhar Cm

ppt zs

Copyright:

Verfügbare Formate

Als PPTX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 9

Im Dokument suchen

ZS Young Data Scientist Challenge

HARSHIT SAXENA
Overview of Data & Quality-checks

 Problem can be defined as Sequence prediction

 Training Data: 3000 Patients, 6472 unique Event IDs
 Testing Data is very small where we have to predict next 10 entries.
 Plotting data of various patients gave meaningful insights as there
are quite a few patients with just active in last 6-8 months whereas
few are active in checkups only till July 2013.[attached screenshots
in next slide]
 It makes more sense to give more weightage to recent data
Visualization of each Patient

This tells variation in Patient data

Visualization of each event

This tells variation in frequency of events

 There are many events with frequency of 30+ in each patient data
which tells that event checkup is done regularly. Hence gives insight
of repeating data when we switch month
 Average frequency of events in month is not very high, typically
around 10 for each patient. So it makes more sense to repeat event
IDs
Model Selection

 First model approach : Hidden Markov Models were performing very

bad on the leaderboard data
 Second Model approach: ALS(alternating least square method) for
correlating patients. It was not able to find good features from data
as a result accuracy was very low
 Third Model approach: Top k elements in each patient data. This
method was used in my final submission.
Feature Engineering

 I created two different datasets - subset data of PatientIDs with high

frequency in just last 6 months – full data of PatientIDs with no presence
in last 6 month
 Created features for calculating frequencies of top 10 elements for
each patientID in last 6 months. This helps to capture 0s in top k (
Suppose an event had high occurrence in patients history but he never
went for that event in last 6 months. This remove wrong values.
 Based on this replaced 0s with most occurring event(highest probability
for occurrence)
 Next important challenge is to predict correct sequence of events
 Created copy of first 3 most frequent events and replaced with last
columns. This gave boost to my score on leaderboard.
 This method may lead to overfit.
Accuracy Check

 Created accuracy checker where it gives error based on

occurrence of event in actual top k.
 Created validation data of 10 events.
 Error = +1(if predicted_event is not in actual_event) else 0
 My error reduced when I replaced most frequent columns by last
columns
 My model was consistent with the error when I tuned the
parameters which give insight of avoiding overfit atleast for this test
data.
Thank You

Das könnte Ihnen auch gefallen

Shoe Dog: A Memoir by the Creator of Nike
Von Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Bewertung: 4.5 von 5 Sternen
4.5/5 (537)
Types of Startups Characteristics Entrepreneurial Process Stages Components
Dokument2 Seiten
Types of Startups Characteristics Entrepreneurial Process Stages Components
Prakhar Cm
Noch keine Bewertungen
Grit: The Power of Passion and Perseverance
Von Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Bewertung: 4 von 5 Sternen
4/5 (587)
An Introduction To Websim A Briefing Guide
Dokument26 Seiten
An Introduction To Websim A Briefing Guide
i_khandelwal
Noch keine Bewertungen
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Von Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Bewertung: 4 von 5 Sternen
4/5 (890)
And Spacing. Broad-Side Array. End-Fire Array. Phased Array.)
Dokument21 Seiten
And Spacing. Broad-Side Array. End-Fire Array. Phased Array.)
Rahul As
Noch keine Bewertungen
The Yellow House: A Memoir (2019 National Book Award Winner)
Von Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Bewertung: 4 von 5 Sternen
4/5 (98)
Notification CDS II English
Dokument49 Seiten
Notification CDS II English
Prakhar Cm
Noch keine Bewertungen
The Little Book of Hygge: Danish Secrets to Happy Living
Von Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Bewertung: 3.5 von 5 Sternen
3.5/5 (399)
Final Exam Solutions UPLOADED
Dokument9 Seiten
Final Exam Solutions UPLOADED
yashar2500
85% (26)
On Fire: The (Burning) Case for a Green New Deal
Von Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Bewertung: 4 von 5 Sternen
4/5 (73)
Tutorial Maths
Dokument70 Seiten
Tutorial Maths
Prakhar Cm
Noch keine Bewertungen
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Von Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Bewertung: 4 von 5 Sternen
4/5 (5794)
Theoretical Computer Science Cheat Sheet
Dokument10 Seiten
Theoretical Computer Science Cheat Sheet
Koushik Mandal
Noch keine Bewertungen
Never Split the Difference: Negotiating As If Your Life Depended On It
Von Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Bewertung: 4.5 von 5 Sternen
4.5/5 (838)
Coursera EJNT4QJHFLE7
Dokument1 Seite
Coursera EJNT4QJHFLE7
Prakhar Cm
Noch keine Bewertungen
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Von Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Bewertung: 4.5 von 5 Sternen
4.5/5 (474)
Python Gsoc
Dokument1 Seite
Python Gsoc
Prakhar Cm
Noch keine Bewertungen
Yes Please
Von Everand
Yes Please
Amy Poehler
Bewertung: 4 von 5 Sternen
4/5 (1891)
4 - 4 - Assignment - 2.3 - (Due 16-June)
Dokument1 Seite
4 - 4 - Assignment - 2.3 - (Due 16-June)
Prakhar Cm
Noch keine Bewertungen
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Von Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Bewertung: 3.5 von 5 Sternen
3.5/5 (231)
Cai (1992)
Dokument9 Seiten
Cai (1992)
Karima
Noch keine Bewertungen
Principles: Life and Work
Von Everand
Principles: Life and Work
Ray Dalio
Bewertung: 4 von 5 Sternen
4/5 (599)
Biostatistik: (Uji Statistik Chi Square, Mann Whitney Dan Wilcoxon)
Dokument8 Seiten
Biostatistik: (Uji Statistik Chi Square, Mann Whitney Dan Wilcoxon)
berto
Noch keine Bewertungen
The Emperor of All Maladies: A Biography of Cancer
Von Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Bewertung: 4.5 von 5 Sternen
4.5/5 (271)
Market Basket Analysis For Data Mining - Msthesis PDF
Dokument75 Seiten
Market Basket Analysis For Data Mining - Msthesis PDF
Ricardo Zonta Santos
Noch keine Bewertungen
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Von Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Bewertung: 4 von 5 Sternen
4/5 (1090)
Minor Project Main PDF
Dokument45 Seiten
Minor Project Main PDF
Bhavik Saun
Noch keine Bewertungen
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Von Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Bewertung: 3.5 von 5 Sternen
3.5/5 (2219)
Business Statistics: A First Course: Seventh Edition
Dokument28 Seiten
Business Statistics: A First Course: Seventh Edition
Ameet Singh Sidhu
Noch keine Bewertungen
Team of Rivals: The Political Genius of Abraham Lincoln
Von Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Bewertung: 4.5 von 5 Sternen
4.5/5 (234)
ME Mech. Design Engineering 2017
Dokument46 Seiten
ME Mech. Design Engineering 2017
Sagar Sawant
Noch keine Bewertungen
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Von Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Bewertung: 4.5 von 5 Sternen
4.5/5 (344)
A Metascientific Review of Evidential Value of Acceptance and Commitment Therapy For Depression
Dokument17 Seiten
A Metascientific Review of Evidential Value of Acceptance and Commitment Therapy For Depression
Gustavo Paniagua
Noch keine Bewertungen
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Von Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Bewertung: 4.5 von 5 Sternen
4.5/5 (265)
Nayatel RP
Dokument20 Seiten
Nayatel RP
Mansoor Shahab
Noch keine Bewertungen
Fear: Trump in the White House
Von Everand
Fear: Trump in the White House
Bob Woodward
Bewertung: 3.5 von 5 Sternen
3.5/5 (738)
S Unit 2 AEE302
Dokument37 Seiten
S Unit 2 AEE302
narra bharath
Noch keine Bewertungen
Angela's Ashes: A Memoir
Von Everand
Angela's Ashes: A Memoir
Frank McCourt
Bewertung: 4.5 von 5 Sternen
4.5/5 (440)
2 Dadm SVN - HT - Proportion
Dokument11 Seiten
2 Dadm SVN - HT - Proportion
Vangara Harish
Noch keine Bewertungen
Rise of ISIS: A Threat We Can't Ignore
Von Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Bewertung: 3.5 von 5 Sternen
3.5/5 (137)
Book Dummy Variables Cla
Dokument2 Seiten
Book Dummy Variables Cla
Debora Kasp
Noch keine Bewertungen
Steve Jobs
Von Everand
Steve Jobs
Walter Isaacson
Bewertung: 4.5 von 5 Sternen
4.5/5 (806)
Correlation Regression MCQs Statistics
Dokument3 Seiten
Correlation Regression MCQs Statistics
narayanasmrithi
Noch keine Bewertungen
John Adams
Von Everand
John Adams
David McCullough
Bewertung: 4.5 von 5 Sternen
4.5/5 (2409)
Nothabo - Magaya - Maths Project
Dokument5 Seiten
Nothabo - Magaya - Maths Project
nothabo
Noch keine Bewertungen
The Unwinding: An Inner History of the New America
Von Everand
The Unwinding: An Inner History of the New America
George Packer
Bewertung: 4 von 5 Sternen
4/5 (45)
Readingandwriting11 q3 Mod6 Identifyingclaims-V3 PDF
Dokument11 Seiten
Readingandwriting11 q3 Mod6 Identifyingclaims-V3 PDF
JOSELYN ALICWADEY
Noch keine Bewertungen
Bad Feminist: Essays
Von Everand
Bad Feminist: Essays
Roxane Gay
Bewertung: 4 von 5 Sternen
4/5 (1015)
Introduction To Machine Learning - Boosting
Dokument6 Seiten
Introduction To Machine Learning - Boosting
d
Noch keine Bewertungen
The Glass Castle: A Memoir
Von Everand
The Glass Castle: A Memoir
Jeannette Walls
Bewertung: 4.5 von 5 Sternen
4.5/5 (1711)
Neural Network - DR - Nadir N. Charniya
Dokument20 Seiten
Neural Network - DR - Nadir N. Charniya
ymcm2010
Noch keine Bewertungen
Wolf Hall: A Novel
Von Everand
Wolf Hall: A Novel
Hilary Mantel
Bewertung: 4 von 5 Sternen
4/5 (3811)
ICACI2023 Reviewer Instructions
Dokument10 Seiten
ICACI2023 Reviewer Instructions
sundeep siddula
Noch keine Bewertungen
The Outsider: A Novel
Von Everand
The Outsider: A Novel
Stephen King
Bewertung: 4 von 5 Sternen
4/5 (1839)
Business Statistics Level 3: LCCI International Qualifications
Dokument19 Seiten
Business Statistics Level 3: LCCI International Qualifications
Hein Linn Kyaw
100% (1)
The Perks of Being a Wallflower
Von Everand
The Perks of Being a Wallflower
Stephen Chbosky
Bewertung: 4.5 von 5 Sternen
4.5/5 (2099)
Measurement Models: Applying Intelligent Methods
Dokument12 Seiten
Measurement Models: Applying Intelligent Methods
Thiago Nunes
Noch keine Bewertungen
The Woman in Cabin 10
Von Everand
The Woman in Cabin 10
Ruth Ware
Bewertung: 3.5 von 5 Sternen
3.5/5 (2322)
Forecasting by Machine Learning Techniques and Econometrics A Review
Dokument7 Seiten
Forecasting by Machine Learning Techniques and Econometrics A Review
Fabian Salazar
Noch keine Bewertungen
The Light Between Oceans: A Novel
Von Everand
The Light Between Oceans: A Novel
M.L. Stedman
Bewertung: 4.5 von 5 Sternen
4.5/5 (789)
CPK Powder Coating
Dokument5 Seiten
CPK Powder Coating
rajkishor101
Noch keine Bewertungen
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Von Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Bewertung: 4.5 von 5 Sternen
4.5/5 (119)
Waer, H. (2021) - The Effect of Integrating Automated Writing Evaluation On EFL Writing Apprehension and Grammatical Knowledge PDF
Dokument26 Seiten
Waer, H. (2021) - The Effect of Integrating Automated Writing Evaluation On EFL Writing Apprehension and Grammatical Knowledge PDF
Khalid Kh
Noch keine Bewertungen
Little Women
Von Everand
Little Women
Louisa May Alcott
Bewertung: 4 von 5 Sternen
4/5 (104)
35.master of Science - Psychology - 1
Dokument64 Seiten
35.master of Science - Psychology - 1
shirleydavid82
Noch keine Bewertungen
Brooklyn: A Novel
Von Everand
Brooklyn: A Novel
Colm Toibin
Bewertung: 3.5 von 5 Sternen
3.5/5 (1937)
HW7
Dokument2 Seiten
HW7
littlekoala
0% (2)
A Man Called Ove: A Novel
Von Everand
A Man Called Ove: A Novel
Fredrik Backman
Bewertung: 4.5 von 5 Sternen
4.5/5 (4609)
BFBFDGHD
Dokument3 Seiten
BFBFDGHD
Gabriel Vasquez
Noch keine Bewertungen
The Art of Racing in the Rain: A Novel
Von Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Bewertung: 4 von 5 Sternen
4/5 (4200)
HW5
Dokument3 Seiten
HW5
Pallavi Raiturkar
Noch keine Bewertungen
Manhattan Beach: A Novel
Von Everand
Manhattan Beach: A Novel
Jennifer Egan
Bewertung: 3.5 von 5 Sternen
3.5/5 (792)
Practical Research 2 Reviewer
Dokument7 Seiten
Practical Research 2 Reviewer
Kirsten Macapallag
Noch keine Bewertungen
A Tree Grows in Brooklyn
Von Everand
A Tree Grows in Brooklyn
Betty Smith
Bewertung: 4.5 von 5 Sternen
4.5/5 (1929)
Contoh Table of Content
Dokument3 Seiten
Contoh Table of Content
muhammad suroo
Noch keine Bewertungen
Sing, Unburied, Sing: A Novel
Von Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Bewertung: 4 von 5 Sternen
4/5 (1103)
Regression and Classification
Dokument26 Seiten
Regression and Classification
Debasree Roy
Noch keine Bewertungen
The Constant Gardener: A Novel
Von Everand
The Constant Gardener: A Novel
John le Carre
Bewertung: 3.5 von 5 Sternen
3.5/5 (104)
Block-4 MCO-3 Unit-4
Dokument26 Seiten
Block-4 MCO-3 Unit-4
Tushar Sharma
Noch keine Bewertungen
Her Body and Other Parties: Stories
Von Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Bewertung: 4 von 5 Sternen
4/5 (821)