Big Data

Hamish Burke | 2025-02-20

Related to: #bigData


Course Info

People


What is big data?

Hugee data, like so big it can't fit on one computer

The 5V's in Big data

  1. Volume
  2. Variety
  3. Velocity
  4. Value
  5. Veracity
  6. Viscosity
  7. Variability
  8. Visualisation

Using Big data

Feature Manipulation

Its a relevant feature if it makes it easier to separate the different classes with it

Diagnostic vs non-diagnostic features

The data density decreases exponentially with dimensionality

Feature Selection

Terms

Why?

Single Feature Ranking

Filter Approach

Pearson's Correlation

Mutual Information