An Efficient Concept-Based Mining Model For Enhancing Text Clustering (Synopsis)

Hochgeladen von

Mumbai Academics

0% fanden dieses Dokument nützlich (0 Abstimmungen)

27 Ansichten4 Seiten

An Efficient Concept-Based Mining Model for Enhancing Text Clustering(Synopsis)

Originaltitel

An Efficient Concept-Based Mining Model for Enhancing Text Clustering(Synopsis)

Copyright

Verfügbare Formate

DOC, PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

An Efficient Concept-Based Mining Model for Enhancing Text Clustering(Synopsis)

Copyright:

Attribution Non-Commercial (BY-NC)

Verfügbare Formate

Als DOC, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

27 Ansichten4 Seiten

An Efficient Concept-Based Mining Model For Enhancing Text Clustering (Synopsis)

Hochgeladen von

Mumbai Academics

An Efficient Concept-Based Mining Model for Enhancing Text Clustering(Synopsis)

Copyright:

Attribution Non-Commercial (BY-NC)

Verfügbare Formate

Als DOC, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 4

Im Dokument suchen

An Efficient Concept-based Mining Model for Enhancing Text Clustering

Synopsis

ABSTRACT
Most of the common techniques in text mining are based on the statistical analysis of a term either word or phrase. Statistical analysis of a term frequency captures the importance of the term within a document only. However, two terms can have the same frequency in their documents, but one term contributes more to the meaning of its sentences than the other term. Thus, the underlying text mining model should indicate terms that capture the semantics of text. In this case, the mining model can capture terms that present the concept of the sentence, which leads to discover the topic of the document. A new concept-based mining model that analyzes terms on the sentence, document, and corpus levels is introduced. The concept-based mining model can effectively discriminate between non-important terms with respect to sentence semantics and terms which hold the concepts that represent the sentence meaning. The proposed mining model consists of sentence-based concept analysis, document-based concept analysis, corpus-based concept-analysis, and conceptbased similarity measure. The term which contributes to the sentence semantics is analyzed on the sentence, document, and corpus levels rather than the traditional analysis of the document only. The proposed model can efficiently find significant matching concepts between documents according to the semantics of their sentences. The similarity between documents is calculated based on a new concept-based similarity measure. The proposed similarity measure takes full advantage of using the concept analysis measures on the sentence, document, and corpus level in calculating the similarity between documents.

Large sets of experiments using the proposed concept-based mining model on different datasets in text clustering are conducted. traditional analysis. Experimental results demonstrate the substantial enhancement of the clustering quality using the sentence-based, document-based, corpus-based and combined approach concept analysis. Index Terms: Concept-based mining model, sentence-based, documentbased, corpus-based, concept-based, concept analysis, conceptual term frequency, concept-based similarity. The experiments demonstrate extensive comparison between the concept-based analysis and the

PROPOSED SYSTEM: In this paper, a novel concept-based mining model is proposed. The proposed model captures the semantic structure of each term within a sentence and document rather than the frequency of the term within a document only. In the proposed model, three measures for analyzing concepts on the sentence, document, and corpus levels are computed. Each sentence is labeled by a semantic role labeler that determines the terms which contribute to the sentence. Each term that has a semantic role in the sentence, is called a concept. Concept can be either words or phrases and are totally dependent on the semantic structure of the sentence. When a new document is introduced to the system, the proposed mining model can detect a concept match from this document to all the previously processed documents in the data set by scanning the new document and extracting the matching concepts.

A new concept-based similarity measure which makes use of the concept analysis on the sentence, document and corpus levels is proposed. Following are the explanations of the important terms used in this paper: Label, Term concept, Verb-argument structure.

SOFTWARE REQUIREMENTS : Operating System Language Database : : : Java Oracle Win XP/ Linux

HARDWARE REQUIREMENT: Processor Ram Hard disk : : : 1.0 GHz 512 Mb 30GB

Das könnte Ihnen auch gefallen

Mahadiscom June
Dokument2 Seiten
Mahadiscom June
Mumbai Academics
Noch keine Bewertungen
Sensitive Label Privacy Protection On Social Network Data
Dokument9 Seiten
Sensitive Label Privacy Protection On Social Network Data
Mumbai Academics
Noch keine Bewertungen
Detecting Malicious Facebook Applications
Dokument14 Seiten
Detecting Malicious Facebook Applications
Mumbai Academics
Noch keine Bewertungen
Virtual Classroom-@mumbai-Academics
Dokument8 Seiten
Virtual Classroom-@mumbai-Academics
Mumbai Academics
Noch keine Bewertungen
Spring
Dokument65 Seiten
Spring
Mumbai Academics
Noch keine Bewertungen
Tic Toe Game-@mumbai-Academics
Dokument9 Seiten
Tic Toe Game-@mumbai-Academics
Mumbai Academics
Noch keine Bewertungen
VideoStegnography (Synopsis) @mumbai Academics
Dokument17 Seiten
VideoStegnography (Synopsis) @mumbai Academics
Mumbai Academics
Noch keine Bewertungen
ImageStegnography (Synopsis) @mumbai Academics
Dokument7 Seiten
ImageStegnography (Synopsis) @mumbai Academics
Mumbai Academics
Noch keine Bewertungen
SNMPSimulator
Dokument7 Seiten
SNMPSimulator
Mumbai Academics
Noch keine Bewertungen
Text Editor-@mumbai-Academics
Dokument6 Seiten
Text Editor-@mumbai-Academics
Mumbai Academics
Noch keine Bewertungen
Accelerator-@mumbai-Academics
Dokument2 Seiten
Accelerator-@mumbai-Academics
Mumbai Academics
Noch keine Bewertungen
Human Face Identification-@mumbai-Academics
Dokument5 Seiten
Human Face Identification-@mumbai-Academics
Mumbai Academics
Noch keine Bewertungen
(Synopsis) Errortrackingsystem
Dokument7 Seiten
(Synopsis) Errortrackingsystem
Mumbai Academics
Noch keine Bewertungen
(Synopsis) GoverLanScanner @mumbai Academics
Dokument8 Seiten
(Synopsis) GoverLanScanner @mumbai Academics
Mumbai Academics
Noch keine Bewertungen
Desktop Videoconference-@mumbai-Academics
Dokument11 Seiten
Desktop Videoconference-@mumbai-Academics
Mumbai Academics
Noch keine Bewertungen
Bankingsystems (Synopsis) @mumbai Academics
Dokument7 Seiten
Bankingsystems (Synopsis) @mumbai Academics
Mumbai Academics
Noch keine Bewertungen
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Von Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Bewertung: 4 von 5 Sternen
4/5 (895)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Von Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Bewertung: 4 von 5 Sternen
4/5 (5794)
Shoe Dog: A Memoir by the Creator of Nike
Von Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Bewertung: 4.5 von 5 Sternen
4.5/5 (537)
Grit: The Power of Passion and Perseverance
Von Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Bewertung: 4 von 5 Sternen
4/5 (588)
The Yellow House: A Memoir (2019 National Book Award Winner)
Von Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Bewertung: 4 von 5 Sternen
4/5 (98)
Principles: Life and Work
Von Everand
Principles: Life and Work
Ray Dalio
Bewertung: 4 von 5 Sternen
4/5 (599)
Yes Please
Von Everand
Yes Please
Amy Poehler
Bewertung: 4 von 5 Sternen
4/5 (1891)
The Little Book of Hygge: Danish Secrets to Happy Living
Von Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Bewertung: 3.5 von 5 Sternen
3.5/5 (400)
Never Split the Difference: Negotiating As If Your Life Depended On It
Von Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Bewertung: 4.5 von 5 Sternen
4.5/5 (838)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Von Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Bewertung: 4.5 von 5 Sternen
4.5/5 (474)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Von Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Bewertung: 3.5 von 5 Sternen
3.5/5 (231)
Rise of ISIS: A Threat We Can't Ignore
Von Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Bewertung: 3.5 von 5 Sternen
3.5/5 (137)
The Emperor of All Maladies: A Biography of Cancer
Von Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Bewertung: 4.5 von 5 Sternen
4.5/5 (271)
Fear: Trump in the White House
Von Everand
Fear: Trump in the White House
Bob Woodward
Bewertung: 3.5 von 5 Sternen
3.5/5 (738)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Von Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Bewertung: 4.5 von 5 Sternen
4.5/5 (266)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Von Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Bewertung: 4.5 von 5 Sternen
4.5/5 (345)
On Fire: The (Burning) Case for a Green New Deal
Von Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Bewertung: 4 von 5 Sternen
4/5 (74)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Von Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Bewertung: 3.5 von 5 Sternen
3.5/5 (2259)
Team of Rivals: The Political Genius of Abraham Lincoln
Von Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Bewertung: 4.5 von 5 Sternen
4.5/5 (234)
The Unwinding: An Inner History of the New America
Von Everand
The Unwinding: An Inner History of the New America
George Packer
Bewertung: 4 von 5 Sternen
4/5 (45)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Von Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Bewertung: 4 von 5 Sternen
4/5 (1090)
Angela's Ashes: A Memoir
Von Everand
Angela's Ashes: A Memoir
Frank McCourt
Bewertung: 4.5 von 5 Sternen
4.5/5 (440)
Steve Jobs
Von Everand
Steve Jobs
Walter Isaacson
Bewertung: 4.5 von 5 Sternen
4.5/5 (806)
Bad Feminist: Essays
Von Everand
Bad Feminist: Essays
Roxane Gay
Bewertung: 4 von 5 Sternen
4/5 (1016)
The Glass Castle: A Memoir
Von Everand
The Glass Castle: A Memoir
Jeannette Walls
Bewertung: 4.5 von 5 Sternen
4.5/5 (1713)
John Adams
Von Everand
John Adams
David McCullough
Bewertung: 4.5 von 5 Sternen
4.5/5 (2409)
The Outsider: A Novel
Von Everand
The Outsider: A Novel
Stephen King
Bewertung: 4 von 5 Sternen
4/5 (1839)
The Light Between Oceans: A Novel
Von Everand
The Light Between Oceans: A Novel
M.L. Stedman
Bewertung: 4.5 von 5 Sternen
4.5/5 (789)
Brooklyn: A Novel
Von Everand
Brooklyn: A Novel
Colm Tóibín
Bewertung: 3.5 von 5 Sternen
3.5/5 (1937)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Von Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Bewertung: 4.5 von 5 Sternen
4.5/5 (121)
The Woman in Cabin 10
Von Everand
The Woman in Cabin 10
Ruth Ware
Bewertung: 3.5 von 5 Sternen
3.5/5 (2322)
Little Women
Von Everand
Little Women
Louisa May Alcott
Bewertung: 4 von 5 Sternen
4/5 (104)
A Man Called Ove: A Novel
Von Everand
A Man Called Ove: A Novel
Fredrik Backman
Bewertung: 4.5 von 5 Sternen
4.5/5 (4610)
Wolf Hall: A Novel
Von Everand
Wolf Hall: A Novel
Hilary Mantel
Bewertung: 4 von 5 Sternen
4/5 (3811)
Manhattan Beach: A Novel
Von Everand
Manhattan Beach: A Novel
Jennifer Egan
Bewertung: 3.5 von 5 Sternen
3.5/5 (792)
The Perks of Being a Wallflower
Von Everand
The Perks of Being a Wallflower
Stephen Chbosky
Bewertung: 4.5 von 5 Sternen
4.5/5 (2104)
The Art of Racing in the Rain: A Novel
Von Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Bewertung: 4 von 5 Sternen
4/5 (4200)
The Constant Gardener: A Novel
Von Everand
The Constant Gardener: A Novel
John le Carré
Bewertung: 3.5 von 5 Sternen
3.5/5 (104)
A Tree Grows in Brooklyn
Von Everand
A Tree Grows in Brooklyn
Betty Smith
Bewertung: 4.5 von 5 Sternen
4.5/5 (1929)
Her Body and Other Parties: Stories
Von Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Bewertung: 4 von 5 Sternen
4/5 (821)
Sing, Unburied, Sing: A Novel
Von Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Bewertung: 4 von 5 Sternen
4/5 (1103)
Axtraxng: Access Control Management Software
Dokument165 Seiten
Axtraxng: Access Control Management Software
sahil
Noch keine Bewertungen
Kontakt 7 6 Manual en
Dokument273 Seiten
Kontakt 7 6 Manual en
Alin Bâte AB
Noch keine Bewertungen
CICS Training Material
Dokument183 Seiten
CICS Training Material
Kirankumar Patti
Noch keine Bewertungen
Bharat 6G Vision Statement - Full
Dokument186 Seiten
Bharat 6G Vision Statement - Full
Nitin Jain
Noch keine Bewertungen
Delete Blank Rows in Excel - Easy Excel Tutorial
Dokument5 Seiten
Delete Blank Rows in Excel - Easy Excel Tutorial
Jamalodeen Mohammad
Noch keine Bewertungen
NCA 2 Instruction Manual 52482
Dokument128 Seiten
NCA 2 Instruction Manual 52482
Juan Lorenzo Martin
100% (1)
RG300-1D-81WT Ug R01 11032011
Dokument79 Seiten
RG300-1D-81WT Ug R01 11032011
tonbar000
0% (1)
Aws-Elastic Load Balancing PDF
Dokument92 Seiten
Aws-Elastic Load Balancing PDF
AW
Noch keine Bewertungen
Certificacion AA
Dokument19 Seiten
Certificacion AA
Soporte Fami
Noch keine Bewertungen
Subject Index Final 30nov16
Dokument33 Seiten
Subject Index Final 30nov16
Ultrazord
Noch keine Bewertungen
Ignition Switch
Dokument5 Seiten
Ignition Switch
safet
Noch keine Bewertungen
Responsive Web Design Tipsheet: Start Small
Dokument3 Seiten
Responsive Web Design Tipsheet: Start Small
santosh kumar
Noch keine Bewertungen
Hpe Cyber
Dokument9 Seiten
Hpe Cyber
cesar
Noch keine Bewertungen
A Computer Mouse
Dokument16 Seiten
A Computer Mouse
Umar Kasymov
Noch keine Bewertungen
How The Dumb Design of A WWII Plane Led To The Macintosh - WIRED
Dokument10 Seiten
How The Dumb Design of A WWII Plane Led To The Macintosh - WIRED
J Alberto Giglio
Noch keine Bewertungen
CMOS Circuit Layout
Dokument4 Seiten
CMOS Circuit Layout
Abhijeet Kumar
Noch keine Bewertungen
Practice Quiz 5
Dokument2 Seiten
Practice Quiz 5
Jatin Preparation
Noch keine Bewertungen
4511 Error Codes
Dokument20 Seiten
4511 Error Codes
tecmex2007
100% (1)
Homework 4 Csc116 Final
Dokument6 Seiten
Homework 4 Csc116 Final
LuIs I. GuTi
Noch keine Bewertungen
RAND North America RAND North America: Exploring CATIA V5 Macros
Dokument13 Seiten
RAND North America RAND North America: Exploring CATIA V5 Macros
Sreedhar Reddy
Noch keine Bewertungen
The Demi Virgin
Dokument4 Seiten
The Demi Virgin
Robert Bonisolo
Noch keine Bewertungen
Darkbasic Help Variables
Dokument17 Seiten
Darkbasic Help Variables
Deven Vyas
Noch keine Bewertungen
20745A ENU Companion
Dokument200 Seiten
20745A ENU Companion
Marculino Lima
Noch keine Bewertungen
AEC Collection Comparison Matrix
Dokument1 Seite
AEC Collection Comparison Matrix
Mohafisto Sofisto
Noch keine Bewertungen
Sorting Through The Features of Proc SORT
Dokument44 Seiten
Sorting Through The Features of Proc SORT
sahr
Noch keine Bewertungen
SSRN Id3851056
Dokument7 Seiten
SSRN Id3851056
Isha Marlecha
Noch keine Bewertungen
Himax: Analog Output Module Manual
Dokument52 Seiten
Himax: Analog Output Module Manual
Roberto Fina
Noch keine Bewertungen
Challenging Tasks - Module 7
Dokument3 Seiten
Challenging Tasks - Module 7
chiefsgs
Noch keine Bewertungen
3125 Advanced Admin Manual
Dokument699 Seiten
3125 Advanced Admin Manual
siouz
Noch keine Bewertungen
CEH Module 21: Physical Security
Dokument80 Seiten
CEH Module 21: Physical Security
Ahmad Mahmoud
100% (1)