Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Exam (elaborations)

Data Mining Clustering

Rating
-
Sold
-
Pages
19
Grade
A
Uploaded on
05-10-2024
Written in
2023/2024

Here’s a similar structured description for **Clustering in Data Mining**: --- Clustering in Data Mining Clustering: is a foundational technique in data mining that focuses on grouping data points into clusters based on their similarities. This method, often used in exploratory data analysis, helps in identifying natural patterns within datasets. As an **unsupervised learning** approach, clustering does not rely on pre-labeled outcomes and is commonly applied in a wide range of fields like marketing, biology, and image processing. Purpose: The primary purpose of clustering is to discover hidden structures in large datasets. It helps in segmenting data, identifying trends, and grouping similar items for targeted analysis. Clustering is extensively used for tasks like market segmentation, anomaly detection, and image processing. Clustering relies heavily on mathematical and computational techniques, often using distance metrics to form clusters. Practical applications include a variety of algorithms, each suited to different types of data and clustering goals. Many clustering methods also allow for flexible use cases, such as detecting outliers or handling noise in datasets. Clustering is a widely-used technique for data scientists, machine learning practitioners, and analysts. It’s commonly included in courses on data mining, machine learning, and exploratory data analysis, making it an essential tool for professionals and students working with large datasets.

Show more Read less
Institution
Course

Content preview

Unit V - Clustering
• Cluster analysis, also known as clustering, is a method of data mining that groups similar data points to
goal of cluster analysis is to divide a dataset into groups (or clusters) such that the data points within each gro
similar to each other than to data points in other groups. This process is often used for exploratory data analysis
identify patterns or relationships within the data that may not be immediately obvious.


Clustering Methods:


1. Partitioning Method: It is used to make partitions on the data in order to form clusters. If “n” partition
on “p” objects of the database then each partition is represented by a cluster and n < p. The two conditions
to be satisfied with this Partitioning Clustering Method are:
• One objective should only belong to only one group.
• There should be no group without even a single purpose.


2. Hierarchical Method: In this method, a hierarchical decomposition of the given set of data objects are crea



Dr.Priya Govindarajan

,There are two types of approaches for the creation of hierarchical decomposition, they are:

•Agglomerative Approach: The agglomerative approach is also known as the bottom-up approach
given data is divided into which objects form separate groups. Thereafter it keeps on merging the o
groups that are close to one another which means that they exhibit similar properties. This mer
continues until the termination condition holds.

•Divisive Approach: The divisive approach is also known as the top-down approach. In this approac
start with the data objects that are in the same cluster. The group of individual clusters is divide
clusters by continuous iteration. The iteration continues until the condition of termination is met or unti
contains one object.


3. Density-Based Method: The density-based method mainly focuses on density. In this method, the
will keep on growing continuously as long as the density in the neighbourhood exceeds some threshold
data point within a given cluster. The radius of a given cluster has to contain at least a minimum numb

4.Grid-Based Method: In the Grid-Based method a grid is formed using the object together,i.e, the ob
quantized into a finite number of cells that form a grid structure.



Dr.Priya Govindarajan

, 5. Model-Based Method: In the model-based method, all the clusters are hypothesized in order to find the data
suited for the model. The clustering of the density function is used to locate the clusters for a given model. It
spatial distribution of data points and also provides a way to automatically determine the number of clust
standard statistics, taking outlier or noise into account. Therefore it yields robust clustering methods.


6.Constraint-Based Method: The constraint-based clustering method is performed by the incorporation of a
user-oriented constraints. A constraint refers to the user expectation or the properties of the desire
results. Constraints provide us with an interactive way of communication with the clustering process. The
application requirement can specify constraints.


Applications Of Cluster Analysis:


• It is widely used in image processing, data analysis, and pattern recognition.
• It helps marketers to find the distinct groups in their customer base and they can characterize their customer gro
purchasing patterns.
• It can be used in the field of biology, by deriving animal and plant taxonomies and identifying genes w
capabilities.
• It also helps in information discovery by classifying documents on the web.

Dr.Priya Govindarajan

Written for

Institution
Course

Document information

Uploaded on
October 5, 2024
Number of pages
19
Written in
2023/2024
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

$3.99
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF

Get to know the seller
Seller avatar
mithun3

Also available in package deal

Get to know the seller

Seller avatar
mithun3 amrita
Follow You need to be logged in order to follow users or courses
Sold
-
Member since
1 year
Number of followers
0
Documents
25
Last sold
-

0.0

0 reviews

5
0
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Working on your references?

Create accurate citations in APA, MLA and Harvard with our free citation generator.

Working on your references?

Frequently asked questions