Thank you for subscribing!

It's great to feel loved.

What Is Data Dredging?

Data dredging alternatively referred to as data mining is a practices which involves the analyzation of large volumes of data without seeking any possible relationships between the analyzed data.

In contrast, the traditional or conventional scientific method of data dredging begins with the hypothesis and then extends through the stages of data examination.

Alternatively conducted for unethical purposes, data dredging is a data mining process that possibly circumvents the traditional techniques of data mining which may then results in premature conclusions.

In the simplest sense, data dredging is described as the act of seeking information from a set of data than it actually contains.

What Does Data Dredging Means in Your Business

Let’s imagine that you want to evaluate the fact that some subsets of your prospects or customers are more likely to upgrade than others. You then tested a specific variable or characteristics of one of your customers and the results of the test revealed that there seems to be a statistical significance.

As a modern data-savvy individual, you then begin to dredge through all the available data, while you are conserving a huge leap of information and characteristics about your customers.

Then, you proceed to develop a hypothesis from each one based on some certain criteria’s such as revenue brackets, geographic region and so forth. You continued to do this until you are able to attain a jackpot and discover that one of the hypotheses is significant. For instance, customers in North London are more likely to upgrade.

The truth is, that conclusion isn’t entirely positive. If you run that experiments continuously, chances are that you’ll discover a correlation that would seem statistically significant but it would still be false positive. Saying that particular correlation is statistically significant might be a fallacy because you are actually data dredging.

The next step is to subject sufficient number of hypothesis to tests. Averagely, you can test up to 20 hypothesis if you are utilizing the standard significance threshold of P=0.95.

Certainly, you’ll discover that some of these tested hypotheses will be statistically significant but misleading. In reality, virtually all data set with any degree of randomness has the possibility of containing some forms of false correlations.

Wendy Gooseberry
whatagraph
Applications and Examples of Big Data from Real Life
Big Data and our ability to effectively capture it has had a significant impact on the corporate landscape. There are multiple big data use cases across all sorts of sectors and industries as it provides useful inputs for any growth-centric culture.
whatagraph
Business Intelligence vs. Data Analytics
Business intelligence and data analytics are not the same thing and using them interchangeably can cause confusion.
whatagraph
5 Strategic Steps on How to Analyse Data
The volume of data you can source from different sources determines the insights you can gain about how effective your business processes are working. It can also position your team to collaborate in alignment with future trends.
In this PPC audit guide, we’ll walk you through the methods to set up your PPC audit while also providing a handy step-by-step PPC audit checklist. With these audit checklist, you’ll have an insight into the areas that need improvements, expansion, and advanced optimization.
Read more...
Mike Bennet
May 27, 2020 4 min read
If you can analyze your search term reports regularly or periodically, you’ll be sure that you aren’t wasting your advertising dollars on irrelevant search queries. Apart from that, you’ll also discover new keyword data and opportunities that you can incorporate into your campaigns.
Read more...
Wendy Gooseberry
May 27, 2020 2 min read
Sending one message to your entire subscribers consistently is an act that has the potentials of diminishing your brand reputation. Apart from that, it depletes an email list of your subscribers as well as halt conversions.
Read more...
Gintaras Baltusevicius
May 26, 2020 3 min read