Big Data

Hamish Burke | 2025-02-20

Related to: #bigData

Course Info


What is big data?

Hugee data, like so big it can't fit on one computer

The 5V's in Big data

  1. Volume
  2. Variety
  3. Velocity
  4. Value
  5. Veracity
  6. Viscosity
  7. Variability
  8. Visualisation

Using Big data

Feature Manipulation

Its a relevant feature if it makes it easier to separate the different classes with it

Diagnostic vs non-diagnostic features

The data density decreases exponentially with dimensionality

Feature Selection



Single Feature Ranking

Filter Approach

Pearson's Correlation

Mutual Information