About

Some bits and pieces about this project...

Our Goal:

Australia's international education activities generate over $15 billion of export income annually. The education industry is important for Australian and the well being of the international students is the key point to tighten the relationship. Our hack aims to help tackle the seven key issues which identified by the NSW Government for enhancing the well being of the international students. 

Novel Data Sets Generated:

1. One-to-one travel zone (TZ) to LGA mapping.
2. TZ-to-TZ travel time via the mode of public transportation.
3. TZ-to-School travel time via the mode of public transportation.
4. TZ-to-School travel time via the mode of push bike.

Methodology

We first gathered data for every university location and geocoded each address to a lat/lng point. In the meantime we imported TfNSW's newly released GTFS files, describing public transport time tables, into the open source tool OpenTripPlanner. We then used OTP's Analyst function to create origin/destination pairs between every travel zone (to every travel zone), as well as from every university to every travel zone. In addition, we also computed the bicycle time from every travel zone to each university, placing priority on cycling on 'safe' roads. Because everything is spatial, we are able to display only travel zones located within ~45m of each campus, by public transport

After the O/D pairs were generated we imported ABS' geographic information (ASGS 2011) into PostGIS. We also imported the travel zone shapefiles from the Bureau of Transport Statistics. Because all of the information was in a spatial database we could do neat queries like finding out where the centre of each travel zone intersects an LGA, which allowed us to build a relation between the different metrics. Every hospital, supermarket and clinic was imported into PostGIS from OpenStreetMap, and we ran spatial queries from the centre of every travel zone to count how many services were within walking distance.

With less than 6 hours to go we put a frontend to all the generated data, including building an API to access information about each travel zone when clicked. Visualising the data with Leaflet was a trivial activity.

Our database is about 4GB large, which is almost entirely spatial data. We are quite proud of the amount of spatial information (from over five sources).

How do we calculate the liveability metrics?

1.Travel time by public transportation (or bike) from the designated school were computed for each travel zone and then given a score from 1 to 100. Higher score for shorter commute.
2.Crime incidents in 2011 were matched with population data by LGA from ABS and recalculated as the population-based crime rate by LGA. A 1 to 100 score was then assigned to each LGA and populated to match each travel zone. Higher score for safer neighborhoods. 
3.Accessibility of service facilities (i.e. public library, health facility and supermarket) were calculated based on a 2 km radius distance for each travel zone. A score of 0, 33, 66 or 100 was then assigned for each travel zone based on the total number of the facilities near by. 
4.The average rent cost (house and unit combined) by LGA were calculated based on the report for the 1st quater in 2013. The data then populated for each travel zone and given a score from 1 to 100. Note: Public data were only availibly for 57 LGA, therefore, we did not factor in the rental cost for the area lack of the data. 
5.The overall score then calculated by averaging the scores from the above four(or three for some area) criterions. 

Tools we used:

1.Python 
2.Postgreaql 
3.SQLAlchemy 
4.GEOAlchemy 
5.PostGIS 
6.jQuery 
7.QGIS  8.OpenTripPlanner 
9.Leaflet 
10.Twitter Bootstrap 
11.Django 
12.Apache 
13.Ubuntu (AWS cheers!) 
14.And Excel. 

Major Achievements​:

1. Public transportation time.
2. Meshed multiple data sets from 8 agencies (and potential to include more!).
3. Create the liveability metric for international students.
4. Our website is designed to tackle the seven key issues which are identified by the NSW Government to impact the well-being of international students.

List of Data Sets Were Meshed:

1. Travel zone spatial data from Australian Bureau of Statistics.
2. GTFS Timetable data from Transportation of NSW.
3. ASGS 2011 spatial and population data from Australian Bureau of Statistics.
4. Rent & sale report from NSW Department of Housing.
5. Record crime data base - NSW from Department of Justice & Attorney General.
6. Public libraries in NSW directory from NSW State Library.
7. Public health care facilities from NSW Ministry of Health.
8. Clinics, doctors, grocery and supermarkets location from Openstreet map.

Future Potentials:

There are a couple features that we'd like to add (had we have just a few more extra hours.....)

1. The frame work of our hack is easy to adjust for different target users (e.g. add childcare, primary education information for families of overseas talents)
2. Add the community diversity profile and other service facilities.
3. Flicker integration with travel zones.
4. Better events integration with travel zones.
5. Multiple languages support. My better half has already translated the templates.
6. Add more information when the TZ is clicked. Have data, ran out of time...