Our project used the following datasets:

  • Airline Origin and Destination Survey (DB1B) by the Bureau of Transportation Statistics

    • The Airline Origin and Destination Survey (DB1B) is a 10% sample of airline tickets from reporting carriers collected by the Office of Airline Information of the Bureau of Transportation Statistics. Data includes origin, destination and other itinerary details of passengers transported. This database is used to determine air traffic patterns, air carrier market shares and passenger flows.

    We use this dataset for the main charcteristics of a given flight ticket.

  • Air Carrier Statistics (Form 41 Traffic)- All Carriers (T-100) (US only segment) by the Bureau of Transportation Statistics

    • The Air Carrier Statistics database, also known as the T-100 data bank, contains domestic and international airline market and segment data. Certificated U.S. air carriers report monthly air carrier traffic information using Form T-100. The data is collected by the Office of Airline Information, Bureau of Transportation Statistics.

    We use this dataset for getting a orgin and destination pair statistics, for example: flight counts, passengers count, seat counts and seat ratios.

  • Median Household Income and Demographics per Metropolitian Area by the US Census Bureau

    • The United States Census Bureau, officially the Bureau of the Census, is a principal agency of the U.S. Federal Statistical System, responsible for producing data about the American people and economy.

    We use the median household income and demographics of a metropolitain area provided by the Bureau to determine which airports are in the protected groups.