Skip to content

ENSGENOMIO-18 Chunking modules#34

Open
markquintontulloch wants to merge 38 commits into
mainfrom
ENSGENOMIO-18
Open

ENSGENOMIO-18 Chunking modules#34
markquintontulloch wants to merge 38 commits into
mainfrom
ENSGENOMIO-18

Conversation

@markquintontulloch
Copy link
Copy Markdown

@markquintontulloch markquintontulloch commented Mar 11, 2026

Creating draft pull request as following changes required:

Container version for Genomio needs to be updated once python code approved, merged, and new release container created.

Tests need to be updated in accordance with group's new policy. Tests now updated to avoid use of stored files

@markquintontulloch markquintontulloch marked this pull request as ready for review May 12, 2026 08:41
Copy link
Copy Markdown
Collaborator

@Dishalodha Dishalodha left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Partial review, will test the code when cluster is back.

Looks good overall, just minor updates.

- conda-forge
- bioconda
dependencies:
- ensembl-genomio=1.6.1 No newline at end of file
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we are in version

Suggested change
- ensembl-genomio=1.6.1
- ensembl-genomio=1.6.2

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will update container version once python PR is approved/merged and we can generate new package of GenomIO

Comment thread modules/ensembl/fasta/recombine/main.nf Outdated
label 'process_medium'

conda "${moduleDir}/environment.yml"
container "ensemblorg/ensembl-genomio:v1.6.1"
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
container "ensemblorg/ensembl-genomio:v1.6.1"
container "ensemblorg/ensembl-genomio:v1.6.2-docker"

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As above

Comment thread tests/config/nextflow.config Outdated

singularity {
enabled = true
enabled = false
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why are we not enabling singularity?

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hangover from local testing, reverted

Comment thread modules/assets/NO_FILE Outdated
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need this folder?

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, I'd initially implemented non-stub tests before we made the decision to only have stub tests - this was a hanger from that. Removed now.

Comment on lines +23 to +24
tag "modules"
tag "modules_ensembl"
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I dont think these tags are required

Suggested change
tag "modules"
tag "modules_ensembl"

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually, reinstating as required by nf-core linting

label 'process_medium'

conda "${moduleDir}/environment.yml"
container "ensemblorg/ensembl-genomio:v1.6.1"
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
container "ensemblorg/ensembl-genomio:v1.6.1"
container "ensemblorg/ensembl-genomio:v1.6.2-docker"

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As above

Comment thread tests/conftest.py Outdated
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I might be wrong but do we need to add this to genomio?

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hangover from non-stub tests - removed

- conda-forge
- bioconda
dependencies:
- ensembl-genomio=1.6.1
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- ensembl-genomio=1.6.1
- ensembl-genomio=1.6.2

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As above

Comment thread modules/ensembl/fasta/recombine/main.nf Outdated
}

def out_fasta = "${meta.id}.fa"

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed

Comment thread modules/ensembl/fasta/split/main.nf Outdated
Comment on lines +22 to +24
container "${workflow.containerEngine in ['singularity', 'apptainer'] && !task.ext.singularity_pull_docker_container
? 'https://depot.galaxyproject.org/singularity/ensembl-genomio:1.6.1--pyhdfd78af_0'
: 'biocontainers/ensembl-genomio:1.6.1--pyhdfd78af_0'}"
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
container "${workflow.containerEngine in ['singularity', 'apptainer'] && !task.ext.singularity_pull_docker_container
? 'https://depot.galaxyproject.org/singularity/ensembl-genomio:1.6.1--pyhdfd78af_0'
: 'biocontainers/ensembl-genomio:1.6.1--pyhdfd78af_0'}"
container "ensemblorg/ensembl-genomio:v1.6.2-docker"

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As above

- conda-forge
- bioconda
dependencies:
- ensembl-genomio=1.6.1
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- ensembl-genomio=1.6.1
- ensembl-genomio=1.6.2

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As above

Comment thread modules/ensembl/fasta/recombine/main.nf Outdated
Comment on lines +22 to +24
container "${workflow.containerEngine in ['singularity', 'apptainer'] && !task.ext.singularity_pull_docker_container
? 'https://depot.galaxyproject.org/singularity/ensembl-genomio:1.6.1--pyhdfd78af_0'
: 'biocontainers/ensembl-genomio:1.6.1--pyhdfd78af_0'}"
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

update container to version 2

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As above

Comment thread modules/ensembl/fasta/recombine/main.nf Outdated

output:
tuple val(meta), path("${meta.id}.fa"), emit: recombined_fasta
tuple val("${task.process}"), val('fasta_recombine'), eval('echo 1.6.1'), emit: versions_fasta_recombine, topic: versions
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe eval('echo 1.6.1') shouldn't be hard coded here, we can capture genomio version or the module version.

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated to use genomio version

Comment thread modules/ensembl/fasta/split/main.nf Outdated
output:
tuple val(meta), path("splits/**/*.fa"), emit: fastas
tuple val(meta), path("splits/*.agp"), emit: agp, optional: true
tuple val("${task.process}"), val('fasta_split'), eval('echo 1.6.1'), emit: versions_fasta_split, topic: versions
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same as before

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated to use genomio version

Comment thread modules/ensembl/fasta/split/main.nf Outdated
tuple val("${task.process}"),
val('fasta_split'),
eval(params.ensembl_genomio_version_cmd),
eval("python -c 'from importlib.metadata import distributions; print(next((dist.version for dist in distributions() if dist.metadata[\"Name\"].lower().replace(\"_\", \"-\") == \"ensembl-genomio\"), \"unknown\"))'"),
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will it be possible to use fasta_split --version directly since I guess container should pick it up

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point. I hadn't implemented a version parameter on fasta_split or the other python packaged. I have now added it to the current ensembl-genomio PR, so will update this to use that once that PR has been approved/merged.

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In fact, updated now since genomio-repeatmasker is using PR branch of ensembl-genomio for CICD and this branch can't be merged until containers updated. Same change made for features_combine_json.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants