Getting ready

We will use the census income dataset available at https://archive.ics.uci.edu/ml/datasets/Census+Income

The dataset has the following characteristics:

  • Number of instances: 48,842
  • Number of attributes: 14

The following is a list of attributes:

  • Age: continuous
  • Workclass: text
  • fnlwgt: continuous
  • Education: text
  • Education-num: continuous
  • Marital-status: text
  • Occupation: text
  • Relationship: text
  • Race: text
  • Sex: female or male
  • Capital-gain: continuous
  • Capital-loss: continuous
  • Hours-per-week: continuous
  • Native-country: text