This repository contains thousands of iOS and Android apps code available on github. It also contains a script to fetch the latest data, if required. Important note: this dataset only contains basic apps that are purely developed on the native framework.
To access current dataset:- For Android, look at Android/androidApps.json For iOS, look at iOS/iosApps.json
To fetch new dataset:-
- Create an authentication PAT token. Hint: click here
- Copy and paste the authentication token in Android/query.py OR iOS/query.py, whichever you need. For pasting search for the method,
runQuery(query)
and paste your PAT in'Authorization': 'Token PASTE_HERE'
- If you need a customized query, search for the method
query*Apps(outputFile*Path)
and replace thequery
attribute's value with your latest query. - Execute the
query.py
by invokingpython3 query.py -o your_output_path
in terminal. - If you "really" wish to have all commit hashes for all repositories, just uncomment the
queryCommits
method call in methodmain()
and runquery.py
.
Some facts:-
- For fetching iOS apps, we are extracting all iOS apps that have the
AppDelegate
object initialized in their code. Hence the query:https://github.com/search?q=appdelegate&type=Code
is executed. - For fetching Android apps, we are extracting all Android apps that have the
onCreate()
method initialized in their code. Hence the query:https://github.com/search?l=Java&q=oncreate&type=Code
is executed.