|
gesel
Validating Gesel database files
|
Validate Gesel database and gene files. More...
Functions | |
| void | validate_database (const std::string &prefix, uint64_t num_genes) |
| uint64_t | validate_genes (const std::string &prefix, const std::vector< std::string > &types) |
| uint64_t | validate_genes (const std::string &prefix) |
Validate Gesel database and gene files.
|
inline |
Validate Gesel database files for a particular species. This checks all files for validity and consistency except for the gene mapping files (which are validated by validate_genes()). Any invalid formatting or inconsistency between files will result in an error.
| prefix | Prefix for the Gesel database files. This should be of the form <DIRECTORY>/<SPECIES>_, where <SPECIES> is an NCBI taxonomy ID. |
| num_genes | Total number of genes for this species. |
|
inline |
Overload for validate_genes(). This will scan the directory for all files starting with prefix and ending with ".tsv.gz".
| prefix | Prefix for the Gesel gene files. This should be of the form <DIRECTORY>/<SPECIES>_, where <SPECIES> is an NCBI taxonomy ID. |
|
inline |
Validate Gesel gene mapping files for a particular species. Any invalid formatting or inconsistency between files will result in an error.
| prefix | Prefix for the Gesel gene mapping files. This should be of the form <DIRECTORY>/<SPECIES>_, where <SPECIES> is an NCBI taxonomy ID. |
| types | Vector of gene name types, e.g., "ensembl", "symbol". This should contain at least one value. |