Birla Institute of Technology &
Science, Pilani
Work Integrated Learning Programmes
Division
First Semester 2017-18
Mid Semester Test (EC2 Regular)
Course No: IS ZC415
Course Title: Data Mining
Nature of Exam: Closed Book
Weightage: 30%
Duration: 2 Hours
Date of Exam: 23/Sep/2017 (AN)
No of pages: 2
No of questions: 4
Page: 1

Page: 2

Solutions:

Answer 1(A):
Binning Methods for Data Smoothing
Sorted data for price (in dollars): 4, 8,
9, 15, 21, 21, 24, 25, 26, 28, 29, 34
*
Partition into (equi-depth) bins:
- Bin 1: 4, 8, 9, 15
- Bin 2: 21, 21, 24, 25
- Bin 3: 26, 28, 29, 34
*
Smoothing by bin means:
- Bin 1: 9, 9, 9, 9
- Bin 2: 23, 23, 23, 23
- Bin 3: 29, 29, 29, 29
*
Smoothing by bin boundaries:
- Bin 1: 4, 4, 4, 15
- Bin 2: 21, 21, 25, 25
- Bin 3: 26, 26, 26, 34
Anwer 1(B)
Euclidean distance is widely used in the
Geometry where shortest distance between two points is often required to
calculated as in distances between two celestial objects in space.
Manhattan distance is used in the
navigation systems to calculate the distance between two points through the
obstacle that are there in the path. This is also known as ‘taxi cab’ distance.
Cosine distance is used in ‘web search,
information retrieval’ where two documents are represented as vectors with
terms as dimensions and similarity between two documents is calculated in the
form of cosine distance between them.
Answer 1(C):
...
A confusion matrix is a table that
is often used to describe the performance of a classification model (or
"classifier") on a set of test data for which the true values are
known.

...

…
In z-score normalization (or zero-mean normalization),
the values for an attribute, A, are normalized based on the mean and
standard deviation of A. A value, v, of A is normalized to
v’ by computing:

Marks
|
Z-score
|
|
Subject 1
|
70
|
(70 – 60)/15 = 0.666
|
Subject 2
|
65
|
(65 - 60)/6 = 0.833
|
Student did better subject 2.
Answer 2:
Use this:
…
x-mean = 1.5
y-mean = 3.5
W1 = ((1-1.5)*(2 – 3.5) + (2-1.5)*(5-3.5)) / ((1-1.5)^2 +
(2-1.5)^2) = 3
W0 = 3.5 – 3*(1.5) = -1
“What will be the class label for nodes with no training
samples?”
Answer 3(A)
Info(D) = -(2/6)(log (2/6) / log (2)) - (4/6)(log (4/6) /
log (2)) = 0.92
...
Intermediate calculation: [-(1/3)*(log(1/3) / log (2))
-(2/3)*(log(2/3) / log (2))] = 0.92
...
...
Tag: BITS WILP Data Mining Mid-Sem Exam 2017-H2
Question 3(B)
...
Answer 4:
Example:
•
Support
—
Usefulness of
discovered rules
•
Confidence
—
Certainty of
discovered rules
computer => antivirus
software [support = 2%, confidence = 60%]
·
A support of 2%
means that 2% of all the transactions under analysis show that computer and
a.v. are purchased together.
·
A confidence of
60% means that 60% of the customers who purchased a computer also bought the
software.

...
Answer 4(B)
From Stackoverflow.com
Ques: I
want to find out the maximal frequent item sets and the closed
frequent item sets.
Frequent
item set X∈F is maximal
if it does not have any frequent supersets.
Frequent
item set X ∈ F is closed if it has no superset
with the same frequency
So I
counted the occurrence of each item set.
{A}
= 4 ; {B} = 2 ; {C} = 5
; {D} = 4 ; {E} = 6
{A,B}
= 1; {A,C} = 3; {A,D} = 3; {A,E} = 4; {B,C} = 2;
{B,D}
= 0; {B,E} = 2; {C,D} = 3; {C,E} = 5; {D,E} = 3
{A,B,C}
= 1; {A,B,D} = 0; {A,B,E} = 1; {A,C,D} = 2; {A,C,E} = 3;
{A,D,E}
= 3; {B,C,D} = 0; {B,C,E} = 2; {C,D,E} = 3
{A,B,C,D}
= 0; {A,B,C,E} = 1; {B,C,D,E} = 0
Min_Support
set to 50%
Does maximal
= {A,B,C,E}?
Does closed
= {A,B,C,D} and {B,C,D,E}?
…
Ans:
Note:
{A} = 4 ; not
closed due to {A,E}
{B} = 2 ; not
frequent => ignore
{C} = 5 ; not
closed due to {C,E}
{D} = 4 ; closed,
but not maximal due to e.g. {A,D}
{E} = 6 ; closed,
but not maximal due to e.g. {D,E}
{A,B} = 1; not frequent => ignore
{A,C} = 3; not closed due to {A,C,E}
{A,D} = 3; not closed due to {A,D,E}
{A,E} = 4; closed, but not maximal due to {A,D,E}
{B,C} = 2; not frequent => ignore
{B,D} = 0; not frequent => ignore
{B,E} = 2; not frequent => ignore
{C,D} = 3; not closed due to {C,D,E}
{C,E} = 5; closed, but not maximal due to {C,D,E}
{D,E} = 4; closed, but not maximal due to {A,D,E}
{A,B,C} = 1; not frequent => ignore
{A,B,D} = 0; not frequent => ignore
{A,B,E} = 1; not frequent => ignore
{A,C,D} = 2; not frequent => ignore
{A,C,E} = 3; maximal frequent
{A,D,E} = 3; maximal frequent
{B,C,D} = 0; not frequent => ignore
{B,C,E} = 2; not frequent => ignore
{C,D,E} = 3; maximal frequent
{A,B,C,D} = 0; not frequent => ignore
{A,B,C,E} = 1; not frequent => ignore
{B,C,D,E} = 0; not frequent =>
ignore
|
Answer to problem:
Frequent
item set X∈F is maximal
if it does not have any frequent supersets.
Frequent
item set X ∈ F is closed if it has no superset with
the same frequency
Closed 2-itemsets: “ab, bc, bd”
Maximal 2-itemsets: “bd”
*****
Awesome...
ReplyDeleteSurvival8: Bits Wilp Software Development For Portable Devices 2017-H2 Handout >>>>> Download Now
Delete>>>>> Download Full
Survival8: Bits Wilp Software Development For Portable Devices 2017-H2 Handout >>>>> Download LINK
>>>>> Download Now
Survival8: Bits Wilp Software Development For Portable Devices 2017-H2 Handout >>>>> Download Full
>>>>> Download LINK Ye
i tried mailing you , but it is not going through. Can you please share the answers to this question paper if you have.
ReplyDeleteThanks
Mail me at ashishjainblogger@gmail.com
DeleteMention the subject(s) for which you want help.
Thanks for putting up the content in such organized manner.Great Help. Thumbs UP !!
ReplyDeleteRecorded lecture links are not working please help
ReplyDeleteIs there a group/whatsapp group for Mtech in computing systems and Infrastructure?
ReplyDeleteVery useful .. Thank you very much
ReplyDeleteAre you Looking for Smart Device Development? QuantaEd Solutions is the Best Smart Device Development Company, We design and develop smart devices that suit the healthcare requirements. For any inquiry call us we will provide all kind of assistance. For more details visit- https://quantaedsolutions.com
ReplyDeleteThis post is so interactive and informative.keep updating more information...
ReplyDeleteSoftware Testing Courses in Mumbai
Software Testing Training in Ahmedabad
Software Testing Courses in Kochi
Software Testing Courses in Trivandrum
Software Testing Courses in Kolkata
Thanks for the blog article.Thanks Again. Keep writing.
ReplyDeletejava online training hyderabad
java online training in india
Thanks for the blog article.Much thanks again. Fantastic.
ReplyDeleteonline training in java
online training on java
AI & ML in Dubai
ReplyDeletehttps://www.nsreem.com/ourservices/ai-ml/
Artificial intelligence is very widespread today. In at least certainly considered one among its various forms has had an impact on all major industries in the world today, NSREEM is #1 AI & ML Service Provider in Dubai
1634348519669-9
Thank you for giving valuable information about software for portable device, we can also develop custom software from pixabulous design.
ReplyDeleteNice Blog!!!
ReplyDeleteServiceNow Training
ServiceNow Online Training in Hyderabad
This article explains in a clear manner. Nice way of explaining. Thanks for sharing. cloud engineering services
ReplyDeleteI really liked your blog post.Much thanks again. Awesome.
ReplyDeletejava online training
java training
Data Science Training In Noida
ReplyDeleteData Science course In Noida
WILP is a set of educational programs designed in such a way that they can be easily integrated into your work life. Earlier, only highly developed nations like the US and Europe were indoctrinating WILPs but now the WILP in India have also gained a lot of popularity.
ReplyDeleteCandidates who wish to take the BITSAT should begin studying as soon as possible. Due to the high level of competition, it is critical to follow the best BITSAT 2022 preparation tips recommended by professionals. This blog post contains BITSAT 2022 study suggestions as well as exam pattern and syllabus information. Continue reading to get answers to all of your questions. To know more information visit @ SSSi Online Tutoring Services.
ReplyDeleteSurvival8: Bits Wilp Software Development For Portable Devices 2017-H2 Handout >>>>> Download Now
ReplyDelete>>>>> Download Full
Survival8: Bits Wilp Software Development For Portable Devices 2017-H2 Handout >>>>> Download LINK
>>>>> Download Now
Survival8: Bits Wilp Software Development For Portable Devices 2017-H2 Handout >>>>> Download Full
>>>>> Download LINK bO
"Thanks for sharing this informative blog on Best Software Development company in chennai,Software Development Company in chennai,
ReplyDeleteBest Software Development company in india,
Top software development company in chennai,
Software Development Company in india"
The BITS Pilani Admission Process is designed to select the brightest minds for its world-class programs. With its independent entrance exam, BITSAT, and direct admission opportunities for board toppers, the institute ensures that only the most deserving candidates secure a place.
ReplyDelete