StudentShare solutions
Triangle menu

Data Mining - Questions to answer - Essay Example

Not dowloaded yet

Extract of sample
Data Mining - Questions to answer

Back-Propagated Delta Rule Networks (BP) is an example for multiple perceptron which contains additional hidden layers. It can function effectively compared to the single layer.
In the prediction process of neural networks to make accurate prediction the training cases are increased which eventually leads to overfitting (George N. Karystinos, 2000). This occurs when the number of input variables is large compared to the training cases or when the input layers are highly correlated with each other. In methods like kernel regression and smoothing splines, the under fitting and overfitting of neural networks is usually encountered. The overfitting occurs in more complex networks. This leads to unprecedented predictions or wild predictions.
Data cleansing is the process of removing inaccurate and inappropriate data records, which is an integral process of data processing and maintenance. In large data sets, the process of finding error and correcting the same needs interaction with the domain experts which is an expensive and time consuming process. Since it involves a comprehensive assignment of identifying and rectifying errors and hence the task is complex. Initially these operations are carried out manually and later computational means of data cleansing evolved and even this process are time consuming and error prone (Heiko Mller et al ).

3. What is the significance of Bayes Theorem in Data Mining Give an example of how statistical inference can be used for Data Mining.
Most of the presently available statistical models in data mining are prone to overfitting and also unstable (sensitive to minor changes in the data). These difficulties can be overcome in the Bayesian methods of statistical mining. The reliability of these algorithms has been reviewed (J. Kolter and M. Maloof, 2003). The Bayesian algorithm facilitates integration of clustering and produces scalable powerful algorithm apt for data mining. Capturing correlation of large number of variables is possible using the Bayesian method.
Example:
In the search process of similar sequences (gene or protein sequences) in a sequence database, the data mining algorithm works by searching for similar matches which is based on the statistical preferences (e- value). Lower the expected value higher the relationship between the query and the retrieved results. Since the data involved is a mere combination of string only statistical measures ensures comparative account of the data sets.
4. Explain the concept of a Maximum Likelihood Estimator with an example.
This is practically applied in prediction of phylogenetic relationships of protein sequences by tree algorithms. The maximum likelihood estimator forms the basis for the evolutionary prediction algorithms. The likelihood function predicts the relative function of all the given datasets (protein sequences). The algorithm eventually finds the most likely relative to the other sequences in the datasets by maximum likelihood estimator and hence it is easy to predict the ancestral route as well as how ...Show more

Summary

1. Is a Neural Network with one or more hidden layers more powerful than a single layer perceptron Explain (Hint: in terms of learning can a neural network with one or more hidden layers learn functions more complex than the perceptron)
An artificial neural network contains networked neurons working together to solve complex problems…
Author : sipesaudreanne
Data Mining - Questions to answer essay example
Read Text Preview
Save Your Time for More Important Things
Let us write or edit the essay on your topic
"Data Mining - Questions to answer"
with a personal 20% discount.
Grab the best paper

Related Essays

Data Mining Techniques
Introduction Data Mining is an iterative and interactive discovery process required by organizations that generate huge volumes of data daily and require analysis on the fly. The decision support system is required to provide information to queries such as finding all cases of fraud, finding the customers that are likely to buy a particular car, etc.
10 pages (2500 words) Essay
Data mining
Therefore it is scientific that a true data mining software application or technique must be able to change data presentation criterion and also discover the previously unknown relationships amongst the data types. Data mining tools allow for possible prediction of the future trends and behaviors, hence enabling for formation of proactive, knowledge-driven decisions.
12 pages (3000 words) Essay
Data mining
The major organizational element, in this case, is the customers. The predictive scores inform the business about the most probable action by the customer. The production of predictive scores occurs when the subject organization design a predictive model.
4 pages (1000 words) Essay
Data Mining
Data mining is not just a collection of data; it is a combination of three technologies primarily contributed by the increase in the computing power, combined with improved data collection and management in addition to the improvements happening in statistical and learning algorithms.
10 pages (2500 words) Essay
Data Mining Questions
The items retrieved in answer to the inquiries become information that can be helpful in making decisions. Information retrieval on the other hand refers to the search of varying information and data in databases, either by detached databases or hypertext networked databases like the World Wide Web.
4 pages (1000 words) Essay
Data Mining and Web Personalization
This system allows companies to provide users with the content and products that they need without having to specifically request the information from the site (Eirinaki & Vazirgiannis, 2003). One of the major business problems in the twenty-first century is the increase in e-commerce and a move away from traditional brick-and-mortar stores.
8 pages (2000 words) Essay
Data mining
1). Depending on the exact nature of the tasks that organization performs, these could be anything. It is not possible to give specific advice, but there are four general principles
2 pages (500 words) Essay
Data Mining
The first is Discovery, or the practice of examining data without a pre-determined hypothesis, in order to discover patterns in it. The discovery stage may occur by classification of data on the basis of clusters, association rules among sets of data, sequential
1 pages (250 words) Essay
Data mining
The heart of customer relationship management (CRM) and personalized marketing programs is data-mining. Spikes can understand the customer’s behavior and preferences by using CRM technologies such as database
8 pages (2000 words) Essay
Data Mining
Examples of data mining software are oracle, Microsoft SQL Server 2012 and SAS. KXEN provide an automated data mining for high productivity model building. It focuses on expanding the use of data mining within analysts, making them more
1 pages (250 words) Essay
Get a custom paper written
by a pro under your requirements!
Win a special DISCOUNT!
Put in your e-mail and click the button with your lucky finger
Your email
YOUR PRIZE:
Apply my DISCOUNT
Comments (0)
Rate this paper:
Thank you! Your comment has been sent and will be posted after moderation