Math Questions
Explore questions in the Math category that you can ask Spark.E!
Spreadsheet models are without question the most ___ business analytics tool
Compares the confidence of the association rule with the overall unconditional probability. - Lift = 1: level of association is the same as no rule at all (random guessing) - Higher the lift, the stronger the association - Lift > 1: strong (positive) association - Lift between 0 and 1: negative association - The closer the lift ratio is to 0, the stronger the negative association
Another useful Excel function not entertained in the next example is
With if-then association rules We need a way to evaluate the ______ of these rules, the strength of associations among products. • Only the strong associations that occur frequently have the potential to ______ consistently in the future.
______ of the association rule is the conditional probability that the consequent occurring given the antecedent occurs.
HCA can have reduced performance on larger datasets and is ____ to outliers.
in Two-Step Cluster Analysis We now extend the analysis with data consisting of both numerical and categorical variables, which is also referred to as
HCA seeks to build a hierarchy of clusters without having ____ number of clusters.
One inherent problem with searching for hidden relationships between items or item sets is dealing with the extremely _____ number of possible combinations.
Association rules are __-____ logical statements that represent relationships among different items or item sets.
Spreadsheet models are mathematical, and logic-based models often referred to as
There are several algorithms that can be used to perform association rule analysis in a more efficient manner. • They all focus on the ______ of item sets.
Association rules are Designed to identify ____ patterns and co-occurring events in data
Association rule analysis is Designed to identify events that tend to occur together - Also known as affinity analysis or
KCA requires advance knowledge of ''
Aiming to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean (cluster centers or cluster centroid), serving as a prototype of the cluster.
Observations are _____ within a cluster
is an unsupervised data mining technique that groups data into categories that share some similar characteristic or trait
Each observation in the data initially forms its own cluster. - Then, merges these clusters into larger clusters based on their similarity - Merging continues until all observations are merged into one final cluster - Referred to as a root
Useful ____ ____ by summarizing a large number of observations in a data set into a small number of clusters