CCP-NC database - Frequently Asked Questions


DEPOSITING DATA

When should I deposit data? The database is intended as a convenient means of accessing published magnetic resonance results. One option is to upload the data when a publication DOI is available. You are encouraged, however, to upload data prior to publication, in order to be able to refer to record numbers in the paper, and add the DOI when available (this will create a new version). Note that it is important to add the DOI when available since database users have the option of restricting searches to data associated with a DOI.

Are there standard parameters I should use for my data to be acceptable? No. Depositors should not worry about whether their calculations are “good enough” to be included in the database. There is sufficient information contained in the magres file for users of the database to judge the “quality” of the results.

I have a very large number of calculation results. Can this be deposited? In principle yes. Normal uploads are limited to 999 records, but users can use the webform to discuss larger deposits.

Can I deposit a set of results together rather than individually? Yes. There is a template CSV spreadsheet that should be completed with the name of each file (first column) and file-specific metadata in the remaining columns; default values for metadata are taken from the submission form. The CSV file must be compressed with the magres files in either a zip or tarball file, and the archive is then submitted via the submission form. Note that the normal limit for bulk uploads is 999 files (see below).

I'm trying to upload a full archive, how do the extref_* entries in the CSV file work? These columns are meant to store external reference codes with which the same structures may appear in different databases. For example, if your structure originates from the Cambridge Structural Database, here is where you would put a reference to it. When uploading a single file this control provides a dropdown to pick the database, but in the CSV this requires three different values:

Can I deposit unpublished data? Yes. Although most data will be associated with publications (via a DOI), it is also useful to include unpublished data, e.g. associated with a PhD thesis. In this case, leave the DOI field blank, but please include a link to the associated document (e.g. a deposited thesis link) in the free text Notes field.

Does it matter if similar calculations have been deposited already? No, indeed, new calculations are encouraged.


HYPOTHETICAL & INCORRECT STRUCTURES

My calculations are of hypothetical structures. Can these be included? These are perfectly fine as they may be valuable, e.g., for machine learning applications. See query about bulk uploads.

My calculations involve results from incorrect structures. Can these be included? Yes, any valid calculation is acceptable, but do use the Notes field to highlight the fact that the structure is thought to be incorrect.


METADATA & FILE FORMATS

What metadata needs to be supplied when uploading a file? The following metadata can be supplied for each file:

What data formats are accepted? Only magres files using the CCP-NC-developed magres format are accepted. These are produced by both the major DFT codes which calculate NMR parameters for periodic solids (CASTEP and Quantum ESPRESSO).


MANAGING & UPDATING DEPOSITED DATA

Can I change the metadata associated with a file? Yes. This is encouraged, for example, to add a publication DOI. Note that any change will create a new version. Changing the metadata of multiple files in block is still not supported, but is planned for a future version.

I have discovered a problem with the data I deposited. Can I change it? Yes. Entries cannot be changed once submitted, but you create a new version of a data set. A search will always return the most recent version of a dataset.

Is the submitted file altered on deposit? No. The file contents are fixed, although the name used for the downloaded data will use the record number e.g. MRD0001256.magres.


How does the Boolean search filter work? The basic search applies a Boolean AND condition by default. This means that when you add multiple search criteria using the button, the results will include only those entries that match all the specified conditions. Additionally, you can use the Boolean filter switch to toggle between an AND condition (default) and a NOT condition, which excludes results matching certain criteria where applicable.

How do I use the Boolean filter switch? Try clicking on the slider button below

Note: The NOT condition is applicable to exclude results based on a few search criteria only such as records with a specific associated publication doi, or to exclude certain data ranges of magnetic shielding and electric field gradient values, or to exclude results on materials using their external database reference code, or to exclude records with a particular data distribution license.

Can I mix AND and NOT conditions in a single search? Yes. You can use an AND condition to include multiple criteria while simultaneously applying a NOT condition to exclude specific entries. For example, you can search for entries that match a specific Chemical name and Publication doi, while excluding entries with a specific External database reference code.

An example search to try:
Search for entries with Chemical name (AND default) as ethanol and Publication doi (AND) as 10.1038/s41467-018-06972-x, while excluding entries with External database reference (NOT) choosing CSD from the Database name menu and entering BAYPAT in the Database code field.

Note: Feel free to change the AND and NOT conditions on the various search criteria to see how the search results change.

What happens if I don't use the Boolean filter switch? If you leave the Boolean filter switch in its default position, the search will combine all criteria using the AND condition. This ensures you only see results that match all of your specified search terms. Use of the Boolean filter switch is optional and does not have to be used if you don't need to exclude any results.


ACCESSING & REFERENCING DATA

Can I download a large number of entries? If a search returns multiple entries, then these can be downloaded as a single zip file. Users can either manually select results that are of interest to them and bulk download the "selection" or simply download the entire search results as a zip archive.
Note: We do not restrict file downloads based on the associated attribution licences. It is, therefore, the responsibility of the downloader to ensure that the datasets with slightly restrictive licenses such as ODC-By and CC-By (see section on Licensing & Permissions) are correctly attributed when used as references in their work.

How can I refer to a dataset? Each individual magres file is assigned a unique numerical ID on deposit called the MRD number (for MagRes files Database), which can be used to refer to the dataset. The MRD numbers will generally be sequential if depositing a set of magres files. Note that files downloaded from the database will be named, for example, MRD000001v1.magres, where “v1” indicates version 1 of the deposit.

How do I obtain the metadata associated with the dataset? A search for a dataset will produce a landing page containing the metadata (including any version history) and a link to download the original submitted file and/or the metadata in a CSV or JSON format. For bulk downloads, the metadata will automatically be included both as a collated CSV file and as a single nested JSON file in the archive.


LICENSING & PERMISSIONS

What licence is used for database entries? Users have the option of three licences when submitting database entries:

The default, and strongly preferred, PDDL licence essentially puts the data in the public domain. Users who prefer (or are required) to use a more restrictive licence can use one of the other two licences, which means that the data licensed cannot be used without attribution, e.g. by citing the reference paper and original author. Users requesting an attribution licence are strongly encouraged to provide suitable attribution text in the Notes field and to choose the appropriate license in the upload form's drop-down menu when uploading.

Database entries that have an attribution licence can still be downloaded in bulk. Note that data that has been bulk downloaded can still be used for purposes such as machine learning training, as long as the appropriate attribution is provided to the original creators/owners of the datasets. We cannot accept data which requires more restrictive licences than the ones offered.


TECHNICAL & ADMINISTRATIVE SUPPORT

How do I contact the database administrators? For requests that cannot be met through the user interface, or for any problems that are not addressed in the FAQs, please use the webform on the CCP-NC website.

How long does my login last? Login (using ORCID id) stores a cookie and should be persistent, but may be canceled after extended periods of time. If that happens, simply log out and log sign back in.


  Go Back