Category: Data analysis and Big Data

  • Cronbach’s alpha

    Cronbach’s alpha

    A statistics concept that has been used for a specific purpose. This has been used to interpret how well the scales, surveys (or test items) function for determining a factor under consideration. It is used to determine reliability or the internal consistency of the test item. The surveys are used to measure things that are…

  • Searching for datasets

    Searching for datasets

    For anything Artificial Intelligence or Machine Learning, datasets are important and sometimes to tune the algorithms requires a dataset that is useful and valid.One search tool that many use is called “GOOGLE” but there is a specific link to search for datasets. https://datasetsearch.research.google.com/ Another site talks about the background of google search engine and other…

  • Intraclass coefficient and Clinical Trials

    Intraclass coefficient and Clinical Trials

    One technique that is used to compare two groups in statistics is to measure whether they are correlated or not – for example, if you are trying to correlate whether food consumption and weight gain are related to each other. There are several tests to tell you whether that is true or not. However, it…

  • Data visualization: Rawgraphs

    Data visualization: Rawgraphs

    It is incredible to see the number of resources that are available for visualizing data. For most people it is a spreadsheet and the graphs it provides. Then there are dedicated programs that do all the data plotting in multiple styles for the user. These are complicated and involved programs that take the data from…

  • 3D Anatomy

    3D Anatomy

    There are so many ways to visualize complex data AND understand it at the same time. However, this has been done for many years for complex data using diagrams.A typical place this has done many times has been anatomy. These diagrams present the views from different viewpoints and these then create the understanding of the…

  • Mutanome

    Mutanome

    Proteins mutate. This happens due to various circumstances but the consequences can be significant. One of the major effects is cancer. The mutated proteins build up properties that are oncogenic and can be a problem.This has been investigated through publications wherein a few mutations were highlighted. Currently, many of the proteins have been cataloged and…

  • Integrative genomics viewer

    Integrative genomics viewer

    It is not possible to use simple plotting or charting programs to view the data and draw meaningful conclusions. A dedicated viewer that enables large data sets to be displayed correctly is one of the best options. It is open source with copyrights by Broad and U. CaliforniaIf you are looking at genomics data and…

  • Extensible Open source -omics software

    Extensible Open source -omics software

    Understanding complex data takes effort. This graphic shows co-morbidites in COVID-19 that was accomplished by a piece of software called Cytoscape The number of open source software that is available is a big list. One of them is Cytoscape. It has ability for wonderful integration of network data from various sources that can be analyzed…

  • Virtual Reality in Biology

    Virtual Reality in Biology

    Virtual Reality (VR) is moving forward so rapidly that it appears that we will be using that as a primary means for visualizing almost all parameters. The part that is stunning is some of the code that is being developed to enable VR. This uses the existing technology and now uses it in applications where…

  • Enable drug discovery

    Enable drug discovery

    Drug discovery is hard.Amazing to see the databases that are available for public access that enable drug discovery. Broad institute publishes The Connectivity map (CMAP)which is a database of gene signatures of transcriptional response to perturbation of many cell lines. This is incredible amount of data that is available in the public domain to be…