Site Data
You can find various data sets here that are used to populate data on the website. It's all public info; just some things conveniently wrapped in single JSON files.
Streaming Sites
This data is collected at the date specified in each JSON file and is based on USA versions of the sites. Due note that this is updated irregularly; I usually do it monthly but there is no set schedule, so just keep that in mind if using it.