CCP-NC database - Frequently Asked Questions

Are there standard parameters I should use for my data to be acceptable? No. Depositors should not worry about whether their calculations are “good enough” to be included in the database. There is sufficient information contained in the magres file for users of the database to judge the “quality” of the results.

Can I change the metadata associated with a file? Yes. This is encouraged, for example, to add a publication DOI. Note that any change will create a new version. Changing the metadata of multiple files in block is still not supported, but is planned for a future version.

How do I contact the database administrators? For requests that cannot be met through the user interface, or for any problems that are not addressed in the FAQs, please use the webform on the CCP-NC website.

Can I deposit a set of results together rather than individually? Yes. There is a template CSV spreadsheet that should be completed with the name of each file (first column) and file-specific metadata in the remaining columns; default values for metadata are taken from the submission form. The CSV file must be compressed with the magres files in either a zip or tarball file, and the archive is then submitted via the submission form. Note that the normal limit for bulk uploads is 999 files (see below).

I'm trying to upload a full archive, how do the extref_* entries in the CSV file work? These columns are meant to store external reference codes with which the same structures may appear in different databases. For example, if your structure originates from the Cambridge Structural Database, here is where you would put a reference to it. When uploading a single file this control provides a dropdown to pick the database, but in the CSV this requires three different values:
- extref_type contains the conventional name of the database. The currently accepted codes are csd (Cambridge Structural Database), icsd (Inorganic Crystal Structure Database), COD (Crystallography Open Database) and other for anything else. This field is case-insensitive;
- extref_other contains the name of whatever database you used that doesn't fall within the previous options. It will be ignored if you didn't pick other as your database;
- extref_code contains an actual code (such as a CSD number or refcode) that can be used to identify the structure within the database you're referencing.

Can I deposit unpublished data? Yes. Although most data will be associated with publications (via a DOI), it is also useful to include unpublished data, e.g. associated with a PhD thesis. In this case, leave the DOI field blank, but please include a link to the associated document (e.g. a deposited thesis link) in the free text Notes field.

Can I download a large number of entries? Currently no. The functionality will be added in a future version, with some limitations for files whose license doesn't allow it.

Does it matter if similar calculations have been deposited already? No, indeed, new calculations are encouraged.

How can I refer to a dataset? Each individual magres file is assigned a unique numerical ID on deposit called the MRD number (for MagRes files Database), which can be used to refer to the dataset. The MRD numbers will generally be sequential if depositing a set of magres files. Note that files downloaded from the database will be named, for example, MRD000001v1.magres, where “v1” indicates version 1 of the deposit.

How do I obtain the metadata associated with the dataset? A search for a dataset will produce a landing page containing the metadata (including any version history) and a link to download the original submitted file and/or the metadata as a text file. For bulk downloads, the metadata will be provided as a CSV file in the archive.

I have a very large number of calculation results. Can this be deposited? In principle yes. Normal uploads are limited to 999 records, but users can use the webform to discuss larger deposits.

I have discovered a problem with the data I deposited. Can I change it? Yes. Entries cannot be changed once submitted, but you create a new version of a data set. A search will always return the most recent version of a dataset.

Is the submitted file altered on deposit? Is the submitted file altered on deposit? No. The file contents are fixed, although the name used for the downloaded data will use the record number e.g. MRD0001256.magres.

How long does my login last? Login (using ORCID id) stores a cookie and should be persistent, but may be canceled after extended periods of time. If that happens, simply log out and log sign back in.

My calculations are of hypothetical structures. Can these be included? These are perfectly fine as they may be valuable, e.g., for machine learning applications. See query about bulk uploads.

My calculations involve results from incorrect structures. Can these be included? Yes, any valid calculation is acceptable, but do use the Notes field to highlight the fact that the structure is thought to be incorrect.

What data formats are accepted? Only magres files using the CCP-NC-developed magres format are accepted. These are produced by both the major DFT codes which calculate NMR parameters for periodic solids (CASTEP and Quantum ESPRESSO).

What licence is used for database entries? Users have the option of three licences when submitting database entries:
- PDDL (Public Domain Dedication and License) v 1.0 (default)
- ODC-By (Open Data Commons Attribution License) v 1.0
- CC-By (Creative Commons Attribution License) v 4.0
The default, and strongly preferred, PDDL licence essentially puts the data in the public domain. Users who prefer (or are required) to use a more restrictive licence can use one of the other two licences, which means that the data licensed cannot be used without attribution, e.g. by citing the reference paper and original author. Users requesting an attribution licence are strongly encouraged to provide suitable attribution text in the Notes field. Database entries that have an attribution licence can only be downloaded individually and will not be available through bulk downloads. Note that data that has been bulk downloaded can be used freely, e.g. for machine learning training, without needing attribution. We cannot accept data which requires more restrictive licences than the ones offered.

What metadata needs to be supplied when uploading a file? The following metadata can be supplied for each file:
- Name. This is the primary search text. It is recommended to keep this simple and use the structural keywords for qualifiers, such as polymorph. Hence use “alanine” as the name and “beta” as a structural qualifier for “alanine form beta”. Note that the database will only accept simple alphanumeric characters in text. Users are encouraged to use IUPAC naming for consistency.
- Keywords: An optional list of structural qualifiers.Please use alphanumeric characters rather than symbols wherever possible, e.g. “beta” instead of β. More than one keyword can be supplied where this would help users to find a specific entry.
- Database number: Optional reference code for starting structure, e.g. Cambridge Structural Database, or ICSD reference code.
- DOI: Optional DOI to publication associated with record. This should be an actual DOI (generally text beginning 10.XXX) rather than a link. Use the Notes field instead if you need to refer to a different type of reference, e.g. a link to a PhD thesis.
- Notes: Free text information. This should be used to provide any information that might be helpful to users, e.g. requested attribution text. Note this text is not directly searchable.

When should I deposit data? The database is intended as a convenient means of accessing published magnetic resonance results. One option is to upload the data when a publication DOI is available. You are encouraged, however, to upload data prior to publication, in order to be able to refer to record numbers in the paper, and add the DOI when available (this will create a new version). Note that it is important to add the DOI when available since database users have the option of restricting searches to data associated with a DOI.

  Go Back