Fix HPO parsing for OBO 1.4 - round 2 by EddieLF · Pull Request #289 · populationgenomics/seqr

EddieLF · 2026-06-15T06:28:46Z

Follow up to #288

The problem

The regex assumed {modifier} comes after ! comment, but OBO 1.4 spec puts the modifier before the comment, and the real HPO file uses the standard ordering. With a line like is_a: HP:0032162 {xref="PMID:31677808"} ! Phenotypic abnormality, the anchored-to-end regex didn't match, split(' ! ')[0] returned HP:0032162 {xref="..."}, and that string was stored as the parent_id, leading to errors.

The fix

re.search(r'HP:\d{7}', value) extracts the parent id directly, regardless of where (or whether) modifiers/comments appear around it.
Raises with the offending line if no HPO id is found, so a malformed is_a: line surfaces a clear error instead of silently storing garbage as parent_id (which is what bit us in the last PR).
update_hpo_tests.py — added a new fixture term HP:9999003 using the OBO 1.4 standard ordering ({modifier} ! comment) so this case is covered going forward; bumped record counts to 8.

MattWellie

Oof, so it was just bad luck that the original failure we spotted was ordered a certain way?

Fix HPO parsing for OBO 1.4 - round 2

f5d4275

EddieLF requested a review from MattWellie June 15, 2026 07:14

EddieLF mentioned this pull request Jun 15, 2026

Hotfix HPO terms .obo file parsing #290

Merged

MattWellie approved these changes Jun 15, 2026

View reviewed changes

EddieLF merged commit 3f68e73 into staging Jun 15, 2026
6 checks passed

EddieLF deleted the fix_hpo_parsing2 branch June 15, 2026 08:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix HPO parsing for OBO 1.4 - round 2#289

Fix HPO parsing for OBO 1.4 - round 2#289
EddieLF merged 1 commit into
stagingfrom
fix_hpo_parsing2

EddieLF commented Jun 15, 2026

Uh oh!

MattWellie left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

EddieLF commented Jun 15, 2026

Follow up to #288

The problem

The fix

Uh oh!

MattWellie left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants