Database and Data Upload
This section allows users to browse all datasets registered within the interface (i.e., custom database). Additionally, users can upload new datasets and modify information regarding existing datasets.
Available data types
The interface accommodates four distinct types of data and specifies the required formatting for each.
A. Count Data / Expression Matrix Data:
The read count tables derived from RNA sequencing, proteomics, and various other sources, whether normalised or non-normalised, are considered. It is essential to note that while non-normalised data is permissible, the interface does not offer functionality for data normalisation. Ensure that the data adheres to the following criteria:
- The table must be in tab-delimited format (either tsv or .txt file), featuring gene names in the index and sample names in the columns.
- The header name (column name) containing gene names should be designated as "id" .
- The sample names must conclude with _Rep# or _rep#.
Example
B. Comparison Data:
Any dataset containing log fold changes and statistical scores, including differentially expressed gene results from RNA sequencing and outcomes of CRISPR screening, among others, suitable for generating a volcano plot, may be input into OmicsBridge. It is imperative that the data complies with the following criteria:
- The table must be formatted as tab-delimited (either tsv or .txt file) and must include headers, featuring gene names in the index.
- The header name (column name) that encompasses gene names should be designated as "id".
Example
C. scRNAseq data:
Users can browse their single-cell RNAseq data, but it must be properly processed and saved as an RDS file. See the "scRNA" section for more details.
D. Epigenetic data (bam, bed, bigwig file, etc):
Bam, bed, and bigwig files generated from ATACseq, ChIPseq, etc. can be browsed in the "Genome browser" section.
How to upload a new dataset
Users can upload new datasets in the 'Data upload' section by following these steps.
1. Upload a file.
A file can be selected or dragged and dropped into the file upload section. Make sure the file format and data format meet the requirements described above. The maximum data size to upload is 1 GB.
2. Complete the dataset information.
Do not use line breaks in any text boxes, as the database will only keep the first line. Fields marked with an asterisk (*) are required. Also, avoid using special characters (such as /,!,?, etc.).
- Dataset Name*: Denotes the name assigned to the dataset to be uploaded. Duplicate dataset names are prohibited.
- Experiment Name*: Refers to the name of the experiment to which the dataset is associated. This information aids in filtering the dataset for selection in the Database or Data Overview section.
- Data Source*: Indicates the origin or creator of the dataset.
- Data Type*: Represents the category of data, such as “DEG from RNAseq” or “CRISPR screening. “ All datasets categorised under the same Data Type must maintain identical data structures (i.e., same header/column names). This uniformity facilitates comparison across datasets (refer to "Compare across datasets" section).
- Data Class*: Select the appropriate classification for the dataset.
- Cell Line (Optional): Specifies the cell line utilised in the experiment (e.g., MCF7, THP1, Mouse Monocyte Derived Macrophages, etc.).
- Collection Date (Optional): Denotes the time period during which the dataset was collected.
- Description (Optional): Provides additional details regarding the dataset.
3. Click on ‘Add to the dataset’.
If the upload is successful, a message stating “Uploaded!” will appear adjacent to the upload button. Additionally, the newly added dataset will be displayed as the first entry in the table.
How to edit the database
Editing the database
Each cell can be edited by double-clicking. Upon the user making an edit, the change will be manifested below the table. The editing process is deemed successful once the user clicks “Save changes” and subsequently confirms the message “saved!”.
Deleting some data
Each row of the database can be selected by simply clicking on it. It is possible to make multiple selections, and the number of selected rows is displayed at the bottom of the table. By clicking “Delete selected data”, all selected rows will be removed from the database.