.

Thursday, April 25, 2019

Data Mining - Questions to answer Essay Example | Topics and Well Written Essays - 1000 words

information Mining - Questions to answer - Essay ExampleBack-Propagated Delta Rule Networks (BP) is an example for multiple perceptron which contains supernumerary hidden layers. It can function effectively compared to the single layer.In the fortune telling fulfil of neural networks to go for accurate call upion the training cases are increased which eventually leads to overfitting (George N. Karystinos, 2000). This occurs when the number of input variables is vauntingly compared to the training cases or when the input layers are highly correlated with each other. In methods like kernel turnaround and smoothing splines, the under fitting and overfitting of neural networks is usually encountered. The overfitting occurs in more complex networks. This leads to unprecedented predictions or ludicrous predictions.Data cleansing is the process of removing inaccurate and inappropriate data records, which is an integral process of data affect and maintenance. In large data sets, t he process of finding error and correcting the same needs interaction with the range experts which is an expensive and time consuming process. Since it involves a comprehensive assignment of identifying and rectifying errors and hence the task is complex. Initially these operations are carried out manually and later computational means of data cleansing evolved and even this process are time consuming and error prone (Heiko Mller et al ).3. What is the significance of Bayes Theorem in Data Mining Give an example of how statistical inference can be used for Data Mining. virtually of the presently available statistical models in data mining are prone to overfitting and also rickety (sensitive to minor changes in the data). These difficulties can be overcome in the Bayesian methods of statistical mining. The reliability of these algorithms has been reviewed (J. Kolter and M. Maloof, 2003). The Bayesian algorithm facilitates integration of clustering and produces scalable powerful alg orithm apt for data mining. Capturing correlation of large number of variables is possible using the Bayesian method. Example In the search process of exchangeable sequences (gene or protein sequences) in a sequence database, the data mining algorithm works by searching for similar matches which is based on the statistical preferences (e- value). Lower the expected value higher the relationship between the motion and the retrieved results. Since the data involved is a mere combination of string only statistical measures ensures comparative written report of the data sets. 4. Explain the concept of a Maximum Likelihood Estimator with an example.This is practically applied in prediction of phylogenetic relationships of protein sequences by tree algorithms. The maximum likelihood estimator forms the basis for the evolutionary prediction algorithms. The likelihood function predicts the relative function of all the given datasets (protein sequences). The algorithm eventually finds the most presumable relative to the other sequences in the datasets by maximum likelihood estimator and hence it is easy to predict the ancestral route as well as how

No comments:

Post a Comment