GC4S: A bioinformatics-oriented Java software library of reusable graphical user interface components

Hugo López-Fernández; Miguel Reboiro-Jato; Daniel Glez-Peña; Rosalía Laza; Reyes Pavón; Florentino Fdez-Riverola

doi:10.1371/journal.pone.0204474

Abstract

Modern bioinformatics and computational biology are fields of study driven by the availability of effective software required for conducting appropriate research tasks. Apart from providing reliable and fast implementations of different data analysis algorithms, these software applications should also be clear and easy to use through proper user interfaces, providing appropriate data management and visualization capabilities. In this regard, the user experience obtained by interacting with these applications via their Graphical User Interfaces (GUI) is a key factor for their final success and real utility for researchers. Despite the existence of different packages and applications focused on advanced data visualization, there is a lack of specific libraries providing pertinent GUI components able to help scientific bioinformatics software developers. To that end, this paper introduces GC4S, a bioinformatics-oriented collection of high-level, extensible, and reusable Java GUI elements specifically designed to speed up bioinformatics software development. Within GC4S, developers of new applications can focus on the specific GUI requirements of their projects, relying on GC4S for generalities and abstractions. GC4S is free software distributed under the terms of GNU Lesser General Public License and both source code and documentation are publicly available at http://www.sing-group.org/gc4s

Citation: López-Fernández H, Reboiro-Jato M, Glez-Peña D, Laza R, Pavón R, Fdez-Riverola F (2018) GC4S: A bioinformatics-oriented Java software library of reusable graphical user interface components. PLoS ONE 13(9): e0204474. https://doi.org/10.1371/journal.pone.0204474

Editor: Frederique Lisacek, Swiss Institute of Bioinformatics, SWITZERLAND

Received: August 6, 2018; Accepted: September 7, 2018; Published: September 20, 2018

Copyright: © 2018 López-Fernández et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The source code of GC4S is available in the following GitHub repository: https://github.com/sing-group/GC4S. Usage instructions and documentation are both available trough the official GC4S project website: http://www.sing-group.org/gc4s/.

Funding: Hugo López-Fernández is supported by a post-doctoral fellowship from Xunta de Galicia (ED481B 2016/068-0).

Competing interests: The authors have declared that no competing interests exist.

Introduction

The importance of bioinformatics and computational biology in research today is leading to the development of a growing number of complex bioinformatics tools and software packages. In this context, these applications must solve two important needs: the provision of reliable implementations of data analysis algorithms, as well as convenient interfaces enabling their effective use for conducting appropriate research [1]. Indeed, several studies highlighting the importance of applying proper software engineering practices and methodologies to systematically develop high-quality bioinformatics software have been published [2,3]. Similarly, some authors have contributed with valuable and helpful recommendations for the appropriate development of effective scientific software [4,5]. The concept of user experience when developing such bioinformatics applications is becoming increasingly important, and some studies also provide useful guidelines for improving the development of user friendly bioinformatics software [1,6]. All the approaches discussed above highlight how the usability of different types of bioinformatics applications can be improved by adopting well-established practices in usability and software engineering.

Taking the previously discussed facts into account, several frameworks, platforms and libraries for different programming languages (e.g. Java, R, Python, C++, etc.) and complementary bioinformatics areas (e.g. proteomics, genomics, etc.) were recently released with the goal of facilitating the development of successful bioinformatics applications [7–11]. This plethora of alternatives provides a reliable basis for developing novel algorithms and data analysis functionalities, allowing developers to extend them and reuse existing code. Several libraries and applications focused on scientific and biological data visualization [7,12–14] are also available. A notable example is the JSparklines library [15], which provides the capability of visualizing numbers in Java Swing tables, making use of sparklines. These data visualization libraries certainly help developers create new applications and provide excellent features to end-users. However, other aspects ensuring an effective user experience (e.g. the creation of input dialogs, configuration panels and other user interaction steps) are inadequately supported due to a lack of specific resources providing reusable Graphical User Interface (GUI) components able to help developers in the same way as many of the aforementioned frameworks and libraries.

In the same line of previous approaches, we also contributed to this collection with AIBench [16], a Java application framework for scientific software development. Since its first release, it has been successfully used to create different bioinformatics software such as LA-iMageS [17], Mass-Up [18], S2P [19,20], BioAnnote [21], BEW [22], @Note [23], ADOPS [24], DPD [25] or MLibrary [26], among others, covering a wide range of application areas including, but not limited to, proteomics, genomics and text mining. By providing the most common functionalities present in typical scientific applications, such as user parameter collection, logging, multithreaded execution, experiment repeatability, workflow management, and automatic GUI generation, AIBench allows developers to focus on the specific application logic. The wide range of successfully released bioinformatics software based on AIBench demonstrates that our framework has provided an effective alternative to increase productivity while developing high-quality source code [27].

The development of AIBench applications always relies on the straightforward implementation of three different but complementary types of programmable components: operations, data types and views. However, our long-term experience in the field has proved that GUI (a key feature in any deployment) should be improved further. As previously stated, AIBench automatically constructs the user interface by generating (i) application menus based on declared operations and (ii) input dialogs for obtaining required operation parameters. However, the view components, which must be developed in Java Swing, are the responsibility of the programmer. In this regard, we noticed that these kinds of components were being copied and pasted between code bases, including non-AIBench based applications. In doing so, developers were missing an opportunity for reusing GUI elements by sharing them between code bases and saving time in future projects.

Bearing all that in mind, and with the specific goal of overcoming these shortcomings, we initially designed, and later developed and tested, GC4S (GUI Components for Swing), an open source software library that aims to help programmers developing Java Swing based scientific applications. GC4S is a collection of high-level, extensible, bioinformatics-oriented and reusable Java GUI elements created by using Swing and SwingX low-level components. By publishing this library, developers of new applications can exclusively focus on the specific GUI requirements of their projects, relying on GC4S for generalities. This way, they can also benefit from bug fixes, updates, and improvements in GC4S. To the best of our knowledge, there is currently no other open-source, free library offering such functionalities.

The remainder of the paper is structured as follows: while Section 2 introduces our novel GC4S library discussing implementation details and essential concepts, Section 3 illustrates the usefulness of GC4S for scientific software development by presenting three complementary case studies showing how GC4S can be easily integrated with the AIBench framework or independently used for speeding up the development process of coding standalone applications. Section 4 discusses the main contributions of this work as well as the lessons we learned in the last few years. Finally, Section 5 summarizes the main conclusions extracted from this contribution and outlines future research work.

The GC4S library: Implementation and components

Implementation

GC4S 1.2 is implemented in Java 8 as an Apache Maven project to automatically build the library. The source code of the project is freely available at https://github.com/sing-group/GC4S under a GNU LGPL 3.0 License (http://www.gnu.org/copyleft/lgpl.html). This project contains six main Maven modules: (i) gc4s, containing the general-purpose GC4S components; (ii) gc4s-genomebrowser, containing a component to visualize genomes and biological data; (iii) gc4s-heatmap, containing a component to create heat map representations of numerical data; (iv) gc4s-jsparklines-factory, containing classes to facilitate the creation of JSparkLines renderers [14]; (v) gc4s-multiple-sequence-alignment-viewer, containing a component to visualize multiple sequence alignments; and (vi) gc4s-statistics-tests-table, containing a component to display datasets in customizable tables and automatically compute p-values and q-values to compare the conditions in them. Moreover, for each of these six modules, there is a *-demo module containing demos and examples.

At the implementation level, components in the main modules (hereafter, GC4S components) can be grouped into three main categories: (i) extensions, that is, components that enhance existing components in order to add functionalities; (ii) high-level components, that is, components that provide new functionalities by grouping other existing components; and (iii) utilities, components that provide service methods (e.g. builders) or resources (e.g. icons).

Components

From the programmer’s perspective, GC4S components serve three main purposes: (i) retrieve user input; (ii) display data or results; and (iii) provide different programming utilities. According to these functionalities, they are classified as Input components, Output (or Visualization) components or Utility components, respectively. Regarding the structure of the source code of the library, Table 1 shows the top-level GC4S packages, reflecting how components are grouped according to their function. In addition, Supporting Information S1 Table provides a comprehensive list of the entire GC4S library, providing a brief description of each component. Extensive Javadoc documentation is also available at http://www.sing-group.org/gc4s/javadoc. Supporting Information S1 Document provides basic documentation about the usage of the library in Maven-based projects as well as examples of some components of the gc4s module.

Download:

Table 1. GC4S library structure.

Packages are prefixed by org.sing_group.gc4s, which is ommitted to avoid redundancy.

https://doi.org/10.1371/journal.pone.0204474.t001

To demonstrate the utility and purpose of the library under consideration, this section illustrates, by way of example, how different components work and can be easily used. As mentioned above, the main purpose of GC4S is to provide a reliable set of generic GUI components that can be efficiently reused in scientific applications development, allowing programmers to focus on concrete problem details.

Genome browser

The gc4s-genomebrowser module provides a graphical component called GenomeBrowser to visualize genomes and different types of biological data, in a manner similar to the web Human Genome Browser at UCSC [28]. This component relies on the PileLine library [29] for parsing the genomic files.

The GenomeBrowser component in Fig 1 shows the visualization of a reference genome with two tracks. As the code snippet in Fig 2 shows, this component must be instantiated by providing an object of a class implementing the GenomeIndex PileLine interface. In this example, the genome index is created using the GenomeIndexBuilder provided by PileLine and then instantiated by creating a PileLineGenomeIndex object. Alternatively, the genome index may be created using samtools [30] and then instantiated by an object of the FaiGenomeIndex class. The tracks containing biological data are read from genomic position files including standard bam (http://samtools.github.io/hts-specs/SAMv1.pdf), pileup (http://samtools.sourceforge.net/pileup.shtml), BED (http://genome.ucsc.edu/FAQ/FAQformat#format1), GFF (http://genome.ucsc.edu/FAQ/FAQformat#format3) or VCF (http://www.internationalgenome.org/wiki/Analysis/vcf4.0/) formats. In the case of bam files, the corresponding .bai index must exist in the same directory as the provided bam file.

Download:

Fig 1. The GenomeBrowser component showing two tracks.

https://doi.org/10.1371/journal.pone.0204474.g001

Download:

Fig 2. Code snippet showing the instantiation of a GenomeBrowser.

Also, the range and chromosome of interest and two tracks are set programmatically.

https://doi.org/10.1371/journal.pone.0204474.g002

The toolbar in the upper area allows users to interactively browse their data by specifying the genome range and chromosome of interest, zooming in or out, and so on. Additionally, other settings such as colour, background colour, maximum number of depth levels, and other options, can be customized for each track.

Heat map

The gc4s-heatmap module provides a graphical component called JHeatMap to visualize numerical data matrices as heat maps, graphical representations where the individual values are represented as colours.

The JHeatMap component in Fig 3 shows the visualization of a data matrix with three rows and five columns. As the code snippet in Fig 4 shows, this component must be instantiated by providing a data matrix and two arrays specifying rows and columns names. Alternatively, it can be instantiated by providing a JHeatMapModel object. Also, the colours that must be used to create the gradient are set programmatically in this example. This component has the mouse wheel zooming feature enabled by default, but it can be programmatically disabled.

Download:

Fig 3. The JHeatMap component.

https://doi.org/10.1371/journal.pone.0204474.g003

Download:

Fig 4. Code snippet showing the instantiation of a JHeatMap.

Also, the colours that must be used to create the colour gradient are set programmatically.

https://doi.org/10.1371/journal.pone.0204474.g004

Based on this core component, it is provided the JHeatMapPanel component that presents a JHeatMap along with a toolbar for controlling different visualization options. These options are: (i) changing the colours of the colour gradient, (ii) changing the minimum and maximum values that must be used to compute the colour gradient, (iii) changing the font settings, (iv) transforming the data matrix (e.g. apply row centring, log transform data, etc.), (v) establishing the visible rows and columns, and (vi) exporting the heat map as image. This component must be instantiated by providing a previously created JHeatMap component.

Multiple Sequence Alignments Viewer

The gc4s-multiple-sequence-alignment-viewer module provides a graphical component called MultipleSequenceAlignmentViewerPanel to display multiple sequence alignment data.

The MultipleSequenceAlignmentViewerPanel component in Fig 5 shows the visualization of two sample sequences. As the code snippet in Fig 6 shows, this component must be instantiated by providing a list of objects that implement the Sequence interface. Also, a configuration object can be passed to set the initial visualization settings. If this object is not provided, the default settings are used.

Download:

Fig 5. The MultipleSequenceAlignmentViewerPanel component.

https://doi.org/10.1371/journal.pone.0204474.g005

Download:

Fig 6. Code snippet showing the instantiation of a MultipleSequenceAlignmentViewerPanel.

The initial visualization settings are also passed to the constructor.

https://doi.org/10.1371/journal.pone.0204474.g006

When this visualization is used, two features are usually required: (i) the possibility of highlighting specific sequence positions (e.g. to point out a specific amino acid or nucleotide as a result of a biological analysis, etc.), (ii) the possibility of adding additional information in form of tracks that are placed before or after the sequences (e.g. to add some kind of score associated with each sequence position). In GC4S, these can be achieved by the usage of SequenceAlignmentRenderer and MultipleSequenceAlignmentTracksModel objects, respectively. These objects can be added to the viewer constructor so that they can be used to show additional information. Fig 7 shows an example of this component configured with a renderer and a model that adds a track named Scores.

Download:

Fig 7. The MultipleSequenceAlignmentViewerPanel component configured with a renderer and a model that adds a track named Scores.

https://doi.org/10.1371/journal.pone.0204474.g007

Based on this core component, it is provided the MultipleSequenceAlignmentViewerControl component that presents a MultipleSequenceAlignmentViewerPanel along with a toolbar with different options for the end-users of the visualization. These options are: (i) selecting the tracks model and renderer that are being shown, (ii) changing the viewer configuration through a GUI dialog, and (iii) exporting the view as image or HTML document. Additional advanced examples of these two components are provided in the corresponding demo module.

Statistics Tests Table

The gc4s-statistics-tests-table module provides a graphical component called StatisticsTestTable to visualize tabular datasets and automatically compute p-values and q-values for each row in order to compare the conditions in them.

The table is generic, meaning that it can be used to display any type of data. The table retrieves the data from generic datasets implementing the Dataset interface, which also has a default implementation called DefaultDataset. The statistical tests used to compute p-values in each row to compare the conditions in the dataset must be compatible with the type of data under consideration. This means that a table created with a Dataset<Boolean> should receive a Test<Boolean>. The package org.sing_group.org.gc4s.statistics.data.tests of this module provides test implementations for Boolean (Chi-squared test, Chi-squared test with Yates' correction, Randomization test, Fisher’s Exact test), Number (several t-tests and ANOVA), and String (Chi-squared test) data. Developers using the library can develop their own tests by simply implementing the Test interface.

The StatisticsTestTable component in Fig 8 shows the visualization of a tiny dataset of Boolean values with two conditions, ten samples and ten features (for illustrative purposes, the size is small because it is meant to be a minimal example; advanced examples with bigger datasets can be found at the corresponding demo module). As the code snippet in Fig 9 shows, this component must be instantiated by providing a Dataset and a Test objects for the same type. Optionally, an object that implements the Correction interface can be provided in order to have one additional column displaying the corrected p-values obtained from the test, as it is the case of this example. After its instantiation, the table is also configured to: (i) set a cell renderer that displays text “YES” or “NO” instead of true and false; (ii) set a header cell renderer that displays sample names using different colours depending on their associated conditions and draws a left border at the first column of each condition to enhance the visual distinction between conditions; (iii) set a highlighter that uses different colours for the cell values (true in red and false in green), and (iv) set a highlighter that draws a left border at the first sample of each condition to enhance the visual distinction between conditions. These classes are also included in this module, thus they are reusable by the programmers of the library in the same way that we use them in the demos.

Download:

Fig 8. The StatisticsTestTable component.

https://doi.org/10.1371/journal.pone.0204474.g008

Download:

Fig 9. Code snippet showing the instantiation and configuration of a StatisticsTestTable.

https://doi.org/10.1371/journal.pone.0204474.g009

The time needed for the computation of the p-values and q-values can vary depending on the dataset size and the statistical tests used, so a ProgressEventListener can be added to the table in order to receive progress events from the component. Based on this core component, it is provided a StatisticsTestTablePanel component that presents a StatisticsTestTable along with a progress bar to monitor the progress computation. This component is instantiated in the same way than the basic table and the table used internally can be obtained programmatically so that it can be fully configured and controlled.

Case studies

Two AIBench-based applications and a regular standalone application were selected as case studies to illustrate how GC4S helped in the development of actual bioinformatics software.

AIBench-based applications: S2P and DEWE

S2P (http://www.sing-group.org/s2p) and DEWE (http://www.sing-group.org/dewe) were constructed using the AIBench framework following the straightforward architecture depicted in Fig 10. The gui module of both applications was entirely constructed with GC4S, allowing us to focus on the specific needs of those developments relying on the general purpose components offered by GC4S.

Download:

Fig 10. AIBench-based applications architecture (S2P and DEWE).

https://doi.org/10.1371/journal.pone.0204474.g010

S2P is an open source application for fast and accurate processing of 2D-gel and MALDI-based mass spectrometry protein data. Since this application allows users to manage and visualize different types of available data, GC4S was used to provide a rich and effective user experience. For instance, GC4S allowed enhancing tables by adding a popup summary to each column (Fig 11) as well as the creation of interactive and customizable heat map visualizations (Fig 12). Tables are also enhanced by adding spark lines from the JSparklines library that are created using the gc4s-jsparklines-factory module (Fig 13).

Download:

Fig 11. A popup summary added to each column of a table using the ColumnSummaryTableCellRenderer component.

https://doi.org/10.1371/journal.pone.0204474.g011

Download:

Fig 12. Heat map visualization of spots data in S2P using the JHeatMap component.

https://doi.org/10.1371/journal.pone.0204474.g012

Download:

Fig 13. Screenshot of the S2P application showing the table that displays a Mascot identifications report.

This and other tables are enhanced with spark lines from the JSparklines library that are created using the gc4s-jsparklines-factory module.

https://doi.org/10.1371/journal.pone.0204474.g013

Additionally, S2P also makes intensive use of the previously mentioned InputParametersPanel component to retrieve user inputs for different operations. Regarding this functionality, there is a noteworthy case: S2P deals with comma-separated values (CSV) files, and it must retrieve the CSV format configuration from the user. As this format is application-independent, GC4S offers a component called CsvPanel, which allows users to either customize or choose a predefined CSV format (Fig 14).

Download:

Fig 14. The CsvPanel component allows users to configure their own CSV format.

https://doi.org/10.1371/journal.pone.0204474.g014

DEWE is a novel open source application for executing RNA-Seq analysis focused on supporting differential expression experiments. As in the case of S2P, this application also benefits from GC4S components to provide a compelling user experience, using the table enhancements previously discussed together with the InputParametersPanel.

The DEWE project has the specific goal of facilitating the configuration and execution of differential expression analysis workflows. As these workflows require different inputs and configurations, a configuration assistant or wizard seemed a good way of supporting this functionality. Consequently, the Wizard component was included in the GC4S library to facilitate the creation of such configuration assistants. This particular component extends JDialog and accepts a list of WizardStep objects. Fig 15 shows the configuration assistant of one of the built-in workflows included in DEWE. While DEWE only needs to implement the specific WizardStep components needed to configure the workflow (i.e. the configuration steps), the Wizard component implemented in GC4S constructs the assistant dialog and manages the wizard buttons as well as the left sidebar that contains the steps labels.

Download:

Fig 15. Example of use of a Wizard in DEWE to create a workflow configuration assistant.

https://doi.org/10.1371/journal.pone.0204474.g015

Standalone developments

As previously commented, the components of the GC4S library can be used either separately or in conjunction with the AIBench framework to implement the view components. An illustrative example of the former case is the development of SEDA (SEquence DAtaset Builder). SEDA (http://www.sing-group.org/seda) is Java Swing application for efficient and flexible processing of FASTA files, offering different functionalities. Apart from using some of the previously commented components, two other notable examples illustrate the broad scope of our GC4S library. The first example involves the use of some of the icons provided in the Icons class. For instance, Fig 16 shows how a different icon from GC4S is used depending on the validity of the selected output directory, which is done using a JFileChooserPanel component.

Download:

Fig 16. Example of use of different icons provided by the Icons class.

https://doi.org/10.1371/journal.pone.0204474.g016

The second example involves a very common situation in which a different panel needs to be shown to the user depending on a previous selection in a combo box. This can be easily achieved by using a CardLayout, switching the visible panel (or card) when the user selection changes. The CardsPanel component provides this functionality and it is used in SEDA to display different configuration panels to the user, depending on the selection previously made in a combo box (Fig 17).

Download:

Fig 17.

Use of CardsPanel in SEDA to display a different configuration panel (A, B, C or D) to the user depending on the choice made in the Rename type combo box.

https://doi.org/10.1371/journal.pone.0204474.g017

Discussion and lessons learned

Two of the main forms of providing reusable software include libraries and frameworks. While libraries can be seen as a set of reusable data structures and functions, frameworks incorporate the concept of dependency inversion whereby the framework shapes the architecture of the application and the user code is plugged into the framework to build the final application. Libraries and frameworks are two complementary concepts. While frameworks usually incorporate libraries with helper functions, they are different concepts.

Java is a good ecosystem for developing reusable software. Not only because of its object oriented nature, which eases reusability by taking advantage of encapsulation and polymorphism, but also because of (i) Swing, which includes a JComponent base class for every visual component and a delegation event model that allows programmers to create and reuse custom interactive components easily, and (ii) Maven, which speeds-up the incorporation of libraries and frameworks to new projects due to its dependency management system.

The Java ecosystem incorporated in 2008 the JavaFX toolkit for the creation of GUI desktop applications as well as Rich Internet Applications. It was meant to be the successor of Java Swing because of its modern features (such as built-in UI controls, CSS support, the WebView component, or smooth animations, among others), but it seems that it has not met the expectations, as it has not achieved the same adoption and support levels (in terms of community forums and experts, for instance) than Java Swing. Furthermore, the sophisticated GUI features included in JavaFX may not be so essential for the development of scientific applications that are easy to use for end-users without advanced informatics skills, while the relative simplicity of Swing probably makes it more appropriate for academia. Although it is hard to estimate the actual usage of both libraries, some numbers can be obtained in this regard. By July 2018 we have seen that: (i) a Google Scholar query shows 42 400 entries for “Java Swing” and 4 950 for “JavaFX”, (ii) a generic Google search shows about 26 600 000 results for “Java Swing” and about 4 170 000 for “JavaFX”, and (iii) a Scopus query throws only 40 results for “JavaFX” but 258 for “Java Swing”. Based on this numbers and the maturity of the Java Swing technology, for which there are many additional third-party libraries and frameworks, it is still an appropriate choice for the development of scientific and bioinformatics software.

In this paper we have presented GC4S, a library of reusable Java Swing components, which is a very useful complement to our AIBench framework for biomedical desktop applications. While AIBench provides applications with a reusable main architecture and structure, based on the input-process-output (IPO) model, the GC4S library provides a set of components that are incorporated as needed for each specific application.

We provided an overview of the GC4S library and presented three real-world use cases to illustrate its capabilities and usefulness. When developing the GUI of these three software applications, Java Swing, SwingX and GC4S were used. As seen in Fig 18, Java Swing classes are the most referenced by GUI classes in the three projects. Nevertheless, the percentage of references to GC4S is notable, especially in DEWE and SEDA. This demonstrates the impact that the presented library had on the development of these three applications.

Download:

Fig 18. Percentages of GUI classes in S2P, DEWE and SEDA with references to Java Swing, GC4S and SwingX classes.

https://doi.org/10.1371/journal.pone.0204474.g018

Conclusions

GC4S (http://www.sing-group.org/GC4S) is open-source and freely distributed under license LGPLv3, providing a set of reusable graphical user interface components for Java Swing. The real utility of GC4S has been demonstrated by different case studies where its use has led to a more efficient software development process. GC4S is open to further extensions and we will keep improving it with fresh and innovative ideas as existing projects evolve or novel developments reach the marketplace.

Supporting information

S1 Table. Complete list of GC4S components by package and type.

Packages and components are listed alphabetically.

https://doi.org/10.1371/journal.pone.0204474.s001

(DOCX)

S1 Document. Basic examples of GC4S.

https://doi.org/10.1371/journal.pone.0204474.s002

(DOCX)

Acknowledgments

H. López-Fernández is supported by a post-doctoral fellowship from Xunta de Galicia (ED481B 2016/068-0). SING group thanks CITI (Centro de Investigación, Transferencia e Innovación) from University of Vigo for hosting its IT infrastructure.

References

1. Bolchini D, Finkelstein A, Perrone V, Nagl S. Better bioinformatics through usability analysis. Bioinformatics. 2009;25:406–12. pmid:19073592
- View Article
- PubMed/NCBI
- Google Scholar
2. Rother K, Potrzebowski W, Puton T, Rother M, Wywial E, Bujnicki JM. A toolbox for developing bioinformatics software. Brief Bioinform. 2012;13:244–57. pmid:21803787
- View Article
- PubMed/NCBI
- Google Scholar
3. Leprevost F da V, Barbosa VC, Francisco EL, Perez-Riverol Y, Carvalho PC. On best practices in the development of bioinformatics software. Front Genet [Internet]. 2014 [cited 2017 Dec 6];5. Available from: http://journal.frontiersin.org/article/10.3389/fgene.2014.00199/abstract
- View Article
- Google Scholar
4. Prlić A, Procter JB. Ten Simple Rules for the Open Development of Scientific Software. PLoS Comput Biol. 2012;8:e1002802. pmid:23236269
- View Article
- PubMed/NCBI
- Google Scholar
5. Seemann T. Ten recommendations for creating usable bioinformatics command line software. GigaScience [Internet]. 2013 [cited 2017 Dec 6];2. Available from: https://academic.oup.com/gigascience/article-lookup/doi/10.1186/2047-217X-2-15
- View Article
- Google Scholar
6. Pavelin K, Cham JA, de Matos P, Brooksbank C, Cameron G, Steinbeck C. Bioinformatics Meets User-Centred Design: A Perspective. Bourne PE, editor. PLoS Comput Biol. 2012;8:e1002554. pmid:22807660
- View Article
- PubMed/NCBI
- Google Scholar
7. Perez-Riverol Y, Wang R, Hermjakob H, Müller M, Vesada V, Vizcaíno JA. Open source libraries and frameworks for mass spectrometry based proteomics: A developer’s perspective. Biochim Biophys Acta BBA—Proteins Proteomics [Internet]. [cited 2013 Nov 18]; Available from: http://www.sciencedirect.com/science/article/pii/S1570963913001039
- View Article
- Google Scholar
8. Stajich JE. The Bioperl Toolkit: Perl Modules for the Life Sciences. Genome Res. 2002;12:1611–8. pmid:12368254
- View Article
- PubMed/NCBI
- Google Scholar
9. Giardine B. Galaxy: A platform for interactive large-scale genome analysis. Genome Res. 2005;15:1451–5. pmid:16169926
- View Article
- PubMed/NCBI
- Google Scholar
10. Prlic A, Yates A, Bliven SE, Rose PW, Jacobsen J, Troshin PV, et al. BioJava: an open-source framework for bioinformatics in 2012. Bioinformatics. 2012;28:2693–5. pmid:22877863
- View Article
- PubMed/NCBI
- Google Scholar
11. Chang W, Cheng J, Allaire JJ, Xie Y, McPherson J, RStudio, et al. shiny: Web Application Framework for R [Internet]. 2018 [cited 2018 Jul 27]. Available from: https://CRAN.R-project.org/package=shiny
12. Wang R, Perez-Riverol Y, Hermjakob H, Vizcaíno JA. Open source libraries and frameworks for biological data visualisation: A guide for developers. PROTEOMICS. 2015;15:1356–74. pmid:25475079
- View Article
- PubMed/NCBI
- Google Scholar
13. Fang X, Miller JA, Arnold J. J3DV: A Java-based 3D database visualization tool. Softw Pract Exp. 2002;32:443–63.
- View Article
- Google Scholar
14. Gansner ER, North SC. An open graph visualization system and its applications to software engineering. Softw Pract Exp. 2000;30:1203–33.
- View Article
- Google Scholar
15. Barsnes H, Vaudel M, Martens L. JSparklines: Making tabular proteomics data come alive. PROTEOMICS. 2015;15:1428–31. pmid:25422159
- View Article
- PubMed/NCBI
- Google Scholar
16. Fdez-Riverola F, Glez-Peña D, López-Fernández H, Reboiro-Jato M, Méndez JR. A JAVA application framework for scientific software development. Softw—Pract Exp. 2012;42:1015–36.
- View Article
- Google Scholar
17. López-Fernández H, de S. Pessôa G, Arruda MAZ, Capelo-Martínez JL, Fdez-Riverola F, Glez-Peña D, et al. LA-iMageS: a software for elemental distribution bioimaging using LA–ICP–MS data. J Cheminformatics [Internet]. 2016 [cited 2017 Dec 6];8. Available from: http://jcheminf.springeropen.com/articles/10.1186/s13321-016-0178-7
18. López-Fernández H, Santos HM, Capelo JL, Fdez-Riverola F, Glez-Peña D, Reboiro-Jato M. Mass-Up: an all-in-one open software application for MALDI-TOF mass spectrometry knowledge discovery. BMC Bioinformatics [Internet]. 2015 [cited 2015 Dec 15];16. Available from: http://www.biomedcentral.com/1471-2105/16/318
19. López-Fernández H, Araújo JE, Jorge S, Glez-Peña D, Reboiro-Jato M, Santos HM, et al. S2P: A software tool to quickly carry out reproducible biomedical research projects involving 2D-gel and MALDI-TOF MS protein data. Comput Methods Programs Biomed. 2018;155:1–9. pmid:29512488
- View Article
- PubMed/NCBI
- Google Scholar
20. López-Fernández H, Araújo JE, Glez-Peña D, Reboiro-Jato M, Fdez-Riverola F, Capelo-Martínez JL. S2P: A Desktop Application for Fast and Easy Processing of 2D-Gel and MALDI-Based Mass Spectrometry Protein Data. In: Fdez-Riverola F, Mohamad MS, Rocha M, De Paz JF, Pinto T, editors. 11th Int Conf Pract Appl Comput Biol Bioinforma [Internet]. Cham: Springer International Publishing; 2017 [cited 2017 Dec 6]. p. 1–8. Available from: http://link.springer.com/10.1007/978-3-319-60816-7_1
21. López-Fernández H, Reboiro-Jato M, Glez-Peña D, Aparicio F, Gachet D, Buenaga M, et al. BioAnnote: A software platform for annotating biomedical documents with application in medical learning environments. Comput Methods Programs Biomed. 2013;111:139–47. pmid:23562645
- View Article
- PubMed/NCBI
- Google Scholar
22. Pérez-Rodríguez G, Glez-Peña D, Azevedo NF, Pereira MO, Fdez-Riverola F, Lourenço A. Enabling systematic, harmonised and large-scale biofilms data computation: The Biofilms Experiment Workbench. Comput Methods Programs Biomed. 2015;118:309–21. pmid:25600941
- View Article
- PubMed/NCBI
- Google Scholar
23. Lourenço A, Carreira R, Carneiro S, Maia P, Glez-Peña D, Fdez-Riverola F, et al. @Note: a workbench for biomedical text mining. J Biomed Inform. 2009;42:710–20. pmid:19393341
- View Article
- PubMed/NCBI
- Google Scholar
24. Reboiro-Jato D, Reboiro-Jato M, Fdez-Riverola F, Vieira CP, Fonseca NA, Vieira J. ADOPS—Automatic Detection Of Positively Selected Sites. J Integr Bioinforma. 2012;9:200.
- View Article
- Google Scholar
25. Santos HM, Reboiro-Jato M, Glez-Peña D, Nunes-Miranda JD, Fdez-Riverola F, Carvallo R, et al. Decision peptide-driven: a free software tool for accurate protein quantification using gel electrophoresis and matrix assisted laser desorption ionization time of flight mass spectrometry. Talanta. 2010;82:1412–20. pmid:20801349
- View Article
- PubMed/NCBI
- Google Scholar
26. Galesio M, López-Fdez H, Reboiro-Jato M, Gómez-Meire S, Glez-Peña D, Fdez-Riverola F, et al. Speeding up the screening of steroids in urine: Development of a user-friendly library. Steroids. 2013;78:1226–32. pmid:24036418
- View Article
- PubMed/NCBI
- Google Scholar
27. López-Fernández H, Reboiro-Jato M, Pérez Rodríguez JA, Fdez-Riverola F, Glez-Peña D. The Artificial Intelligence Workbench: a retrospective review. ADCAIJ Adv Distrib Comput Artif Intell J. 2016;5:73.
- View Article
- Google Scholar
28. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, et al. The Human Genome Browser at UCSC. Genome Res. 2002;12:996–1006. pmid:12045153
- View Article
- PubMed/NCBI
- Google Scholar
29. Glez-Peña D, Gómez-López G, Reboiro-Jato M, Fdez-Riverola F, Pisano DG. PileLine: a toolbox to handle genome position information in next-generation sequencing studies. BMC Bioinformatics. 2011;12:31. pmid:21261974
- View Article
- PubMed/NCBI
- Google Scholar
30. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–9. pmid:19505943
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Bolchini D, Finkelstein A, Perrone V, Nagl S. Better bioinformatics through usability analysis. Bioinformatics. 2009;25:406–12. pmid:19073592
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Rother K, Potrzebowski W, Puton T, Rother M, Wywial E, Bujnicki JM. A toolbox for developing bioinformatics software. Brief Bioinform. 2012;13:244–57. pmid:21803787
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Leprevost F da V, Barbosa VC, Francisco EL, Perez-Riverol Y, Carvalho PC. On best practices in the development of bioinformatics software. Front Genet [Internet]. 2014 [cited 2017 Dec 6];5. Available from: http://journal.frontiersin.org/article/10.3389/fgene.2014.00199/abstract
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref4] 4. Prlić A, Procter JB. Ten Simple Rules for the Open Development of Scientific Software. PLoS Comput Biol. 2012;8:e1002802. pmid:23236269
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref5] 5. Seemann T. Ten recommendations for creating usable bioinformatics command line software. GigaScience [Internet]. 2013 [cited 2017 Dec 6];2. Available from: https://academic.oup.com/gigascience/article-lookup/doi/10.1186/2047-217X-2-15
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref6] 6. Pavelin K, Cham JA, de Matos P, Brooksbank C, Cameron G, Steinbeck C. Bioinformatics Meets User-Centred Design: A Perspective. Bourne PE, editor. PLoS Comput Biol. 2012;8:e1002554. pmid:22807660
View Article
PubMed/NCBI
Google Scholar

[20] View Article

[21] PubMed/NCBI

[22] Google Scholar

[ref7] 7. Perez-Riverol Y, Wang R, Hermjakob H, Müller M, Vesada V, Vizcaíno JA. Open source libraries and frameworks for mass spectrometry based proteomics: A developer’s perspective. Biochim Biophys Acta BBA—Proteins Proteomics [Internet]. [cited 2013 Nov 18]; Available from: http://www.sciencedirect.com/science/article/pii/S1570963913001039
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref8] 8. Stajich JE. The Bioperl Toolkit: Perl Modules for the Life Sciences. Genome Res. 2002;12:1611–8. pmid:12368254
View Article
PubMed/NCBI
Google Scholar

[27] View Article

[28] PubMed/NCBI

[29] Google Scholar

[ref9] 9. Giardine B. Galaxy: A platform for interactive large-scale genome analysis. Genome Res. 2005;15:1451–5. pmid:16169926
View Article
PubMed/NCBI
Google Scholar

[31] View Article

[32] PubMed/NCBI

[33] Google Scholar

[ref10] 10. Prlic A, Yates A, Bliven SE, Rose PW, Jacobsen J, Troshin PV, et al. BioJava: an open-source framework for bioinformatics in 2012. Bioinformatics. 2012;28:2693–5. pmid:22877863
View Article
PubMed/NCBI
Google Scholar

[35] View Article

[36] PubMed/NCBI

[37] Google Scholar

[ref11] 11. Chang W, Cheng J, Allaire JJ, Xie Y, McPherson J, RStudio, et al. shiny: Web Application Framework for R [Internet]. 2018 [cited 2018 Jul 27]. Available from: https://CRAN.R-project.org/package=shiny

[ref12] 12. Wang R, Perez-Riverol Y, Hermjakob H, Vizcaíno JA. Open source libraries and frameworks for biological data visualisation: A guide for developers. PROTEOMICS. 2015;15:1356–74. pmid:25475079
View Article
PubMed/NCBI
Google Scholar

[40] View Article

[41] PubMed/NCBI

[42] Google Scholar

[ref13] 13. Fang X, Miller JA, Arnold J. J3DV: A Java-based 3D database visualization tool. Softw Pract Exp. 2002;32:443–63.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref14] 14. Gansner ER, North SC. An open graph visualization system and its applications to software engineering. Softw Pract Exp. 2000;30:1203–33.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref15] 15. Barsnes H, Vaudel M, Martens L. JSparklines: Making tabular proteomics data come alive. PROTEOMICS. 2015;15:1428–31. pmid:25422159
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref16] 16. Fdez-Riverola F, Glez-Peña D, López-Fernández H, Reboiro-Jato M, Méndez JR. A JAVA application framework for scientific software development. Softw—Pract Exp. 2012;42:1015–36.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref17] 17. López-Fernández H, de S. Pessôa G, Arruda MAZ, Capelo-Martínez JL, Fdez-Riverola F, Glez-Peña D, et al. LA-iMageS: a software for elemental distribution bioimaging using LA–ICP–MS data. J Cheminformatics [Internet]. 2016 [cited 2017 Dec 6];8. Available from: http://jcheminf.springeropen.com/articles/10.1186/s13321-016-0178-7

[ref18] 18. López-Fernández H, Santos HM, Capelo JL, Fdez-Riverola F, Glez-Peña D, Reboiro-Jato M. Mass-Up: an all-in-one open software application for MALDI-TOF mass spectrometry knowledge discovery. BMC Bioinformatics [Internet]. 2015 [cited 2015 Dec 15];16. Available from: http://www.biomedcentral.com/1471-2105/16/318

[ref19] 19. López-Fernández H, Araújo JE, Jorge S, Glez-Peña D, Reboiro-Jato M, Santos HM, et al. S2P: A software tool to quickly carry out reproducible biomedical research projects involving 2D-gel and MALDI-TOF MS protein data. Comput Methods Programs Biomed. 2018;155:1–9. pmid:29512488
View Article
PubMed/NCBI
Google Scholar

[59] View Article

[60] PubMed/NCBI

[61] Google Scholar

[ref20] 20. López-Fernández H, Araújo JE, Glez-Peña D, Reboiro-Jato M, Fdez-Riverola F, Capelo-Martínez JL. S2P: A Desktop Application for Fast and Easy Processing of 2D-Gel and MALDI-Based Mass Spectrometry Protein Data. In: Fdez-Riverola F, Mohamad MS, Rocha M, De Paz JF, Pinto T, editors. 11th Int Conf Pract Appl Comput Biol Bioinforma [Internet]. Cham: Springer International Publishing; 2017 [cited 2017 Dec 6]. p. 1–8. Available from: http://link.springer.com/10.1007/978-3-319-60816-7_1

[ref21] 21. López-Fernández H, Reboiro-Jato M, Glez-Peña D, Aparicio F, Gachet D, Buenaga M, et al. BioAnnote: A software platform for annotating biomedical documents with application in medical learning environments. Comput Methods Programs Biomed. 2013;111:139–47. pmid:23562645
View Article
PubMed/NCBI
Google Scholar

[64] View Article

[65] PubMed/NCBI

[66] Google Scholar

[ref22] 22. Pérez-Rodríguez G, Glez-Peña D, Azevedo NF, Pereira MO, Fdez-Riverola F, Lourenço A. Enabling systematic, harmonised and large-scale biofilms data computation: The Biofilms Experiment Workbench. Comput Methods Programs Biomed. 2015;118:309–21. pmid:25600941
View Article
PubMed/NCBI
Google Scholar

[68] View Article

[69] PubMed/NCBI

[70] Google Scholar

[ref23] 23. Lourenço A, Carreira R, Carneiro S, Maia P, Glez-Peña D, Fdez-Riverola F, et al. @Note: a workbench for biomedical text mining. J Biomed Inform. 2009;42:710–20. pmid:19393341
View Article
PubMed/NCBI
Google Scholar

[72] View Article

[73] PubMed/NCBI

[74] Google Scholar

[ref24] 24. Reboiro-Jato D, Reboiro-Jato M, Fdez-Riverola F, Vieira CP, Fonseca NA, Vieira J. ADOPS—Automatic Detection Of Positively Selected Sites. J Integr Bioinforma. 2012;9:200.
View Article
Google Scholar

[76] View Article

[77] Google Scholar

[ref25] 25. Santos HM, Reboiro-Jato M, Glez-Peña D, Nunes-Miranda JD, Fdez-Riverola F, Carvallo R, et al. Decision peptide-driven: a free software tool for accurate protein quantification using gel electrophoresis and matrix assisted laser desorption ionization time of flight mass spectrometry. Talanta. 2010;82:1412–20. pmid:20801349
View Article
PubMed/NCBI
Google Scholar

[79] View Article

[80] PubMed/NCBI

[81] Google Scholar

[ref26] 26. Galesio M, López-Fdez H, Reboiro-Jato M, Gómez-Meire S, Glez-Peña D, Fdez-Riverola F, et al. Speeding up the screening of steroids in urine: Development of a user-friendly library. Steroids. 2013;78:1226–32. pmid:24036418
View Article
PubMed/NCBI
Google Scholar

[83] View Article

[84] PubMed/NCBI

[85] Google Scholar

[ref27] 27. López-Fernández H, Reboiro-Jato M, Pérez Rodríguez JA, Fdez-Riverola F, Glez-Peña D. The Artificial Intelligence Workbench: a retrospective review. ADCAIJ Adv Distrib Comput Artif Intell J. 2016;5:73.
View Article
Google Scholar

[87] View Article

[88] Google Scholar

[ref28] 28. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, et al. The Human Genome Browser at UCSC. Genome Res. 2002;12:996–1006. pmid:12045153
View Article
PubMed/NCBI
Google Scholar

[90] View Article

[91] PubMed/NCBI

[92] Google Scholar

[ref29] 29. Glez-Peña D, Gómez-López G, Reboiro-Jato M, Fdez-Riverola F, Pisano DG. PileLine: a toolbox to handle genome position information in next-generation sequencing studies. BMC Bioinformatics. 2011;12:31. pmid:21261974
View Article
PubMed/NCBI
Google Scholar

[94] View Article

[95] PubMed/NCBI

[96] Google Scholar

[ref30] 30. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–9. pmid:19505943
View Article
PubMed/NCBI
Google Scholar

[98] View Article

[99] PubMed/NCBI

[100] Google Scholar

Figures

Abstract

Introduction

The GC4S library: Implementation and components

Implementation

Components

Genome browser

Heat map

Multiple Sequence Alignments Viewer

Statistics Tests Table

Case studies

AIBench-based applications: S2P and DEWE

Standalone developments

Discussion and lessons learned

Conclusions

Supporting information

S1 Table. Complete list of GC4S components by package and type.

S1 Document. Basic examples of GC4S.

Acknowledgments

References