The computing power is increasing at the rate specified by Moore's law, doubling every eighteen months. The technology upgrade to parallel processing has vastly contributed to more powerful machines. There have been a number of statistical applications and algorithms that were waiting for larger computing power to arrive. Data mining makes use of these algorithms to enable data mining possibilities. In addition to these, data is being collected in a very large scale at all levels. More the data better the data mining exercise has been the watchword of most of the work that is carried out. All these combine to make data mining. Using these data and applying appropriate models, the results of the data mining is obtained. This would enable businesses to identify buying behaviour patterns from customers; identify customer demographic characteristics and predict customer response to mails.
Most of the cases, both commercial and scientific establishments report a condition where there is a large quantity of data which is collected and stored. But there is hardly any information for the people to make use of. In its basics, the data mining efforts start with employing appropriate data models that would help in understanding the system and its behaviour (Hand D J, 2001). This would further help in augmenting the nature of work executed and the future of the object becomes more predictable. This is possible to do only if the object is understood well and the modelling is realised to the closest possible accuracy. A number of modelling tools help in data mining. Typically, Decision Trees, rule Induction, Regression Models and Neural networks. All these contribute to extracting needed data from the databases using the data mining tools. These are not simple straight forward SQL statements.
Qualitative analysis is possible with the predicate data that would use this to identify and get objective visualisation of the object being modelled. Whereas in a quantitative analysis, the data is used for automatic processing based on specific input data or time. Based on the model the information and data available in the system is extracted to meet the requirements. In case of the banks, this would help them in identifying and detecting patterns of fraudulent credit card usage. The banks might like to identify loyal customers and those who might change their loyalty even with a minor issue. It also helps in identifying credit card spending by customer groups and finding any specific correlation between different financial indicators.
Issues with using and administrating data mining products
Most of the data mining work is done using tools that would execute the job required by the users of the system. These tools are made to build an appropriate statistical model that might be required for the user. These data mining exercises generally provide the industries in ascertaining the trends, patterns and relationships in the data present. This would help the companies, for instance, to identify the market segmentation, detecting fraud in systems, direct marketing, customer churn, etc., All this would help the companies in realising a large movement in the market helping them to realise where the market is moving and appropriately organise their own internal plans to take care of this movement. This would also let the businesses