Masterdata
PyHPO ships with a full version of the HP ontology, including gene and disease associations. I try to keep this data up to date, but will frequently fall behind on the release schedule.
To build the ontology, the following 3 files are needed from the HPO masterdata:
http://purl.obolibrary.org/obo/hp.obo
http://purl.obolibrary.org/obo/hp/hpoa/phenotype_to_genes.txt
http://purl.obolibrary.org/obo/hp/hpoa/phenotype.hpoa
Updating to the most recent version
To update to the most recent version of the masterdata, you can use the following Python script:
from pyhpo.update_data import download_data
download_data()
Manually updating the masterdata
By default, the HPO masterdata is stored along the installed library. You could manually update the files in there, but that is not recommended. Instead, you can download the HPO masterdata into a local folder and specify that folder during instantiation of the Ontology:
from pyhpo import Ontology
_ = Ontology("/path/to/folder/with/masterdata/")