Complexity and simplicity

Brain achieves a remarkable task to simplify the world and it is important for us to distinguish data from the noise. Take the example of a small piece of forest. This forest has trees, plants and some animals. The trees by themselves have branches, leaves, and other structures that have distinct shapes and sizes. If we had to classify all the information on one tree, then we would have to detail significant amount of information – detail information about shapes of each branch, shapes, color and size of each leaf since neither each branch or leaf is identical to another.

Put it in this way and then combined with information about each tree, there is a tremendous complexity to the forest and the structures that are described in detail.

However, calling a tree – a tree, makes several assumptions about some random nature of the tree and also some expected variation in a leaf. Once you have made that leap of generalization then the forest becomes a collection of trees that has some variation is size, shape, and color but can be broadly identified and grouped into a structure called a tree. However, a diseased tree had slightly different characteristics – for example fewer leaves or anomalies in the branches that are not as branched and appear broken.

When looking at Big Data it is important to reduce the data down to structures, shapes or broad categories that separates the expected noise from the small signal. Then it is much easier to comprehend and understand the structure of the data and find the signal among the noise.

Similar Posts

  • | |

    Hypertension

    There are many therapies for regulating high blood pressure. The one that has received some attention is the natural therapy by consumption of beet in various forms. Juice, raw, tablets, and cooked. The primary benefit appears to be conferred by the presence of nitrates, which enhance the amount of Nitric oxide present in blood and…

  • Open source goodies

    Open source text search engine There are many technologies available that get slightly better packaged and then sold commercially. Often times the technologies are so superb and crowd sourced so well that it is surprising that many people do not consider it as a valid strategy for their laboratories or companies. Take the example of…

  • |

    Adeno associated virus

    Adeno associated virus (AAV) has been one of major interests for gene therapy. AAV was discovered in 1965 as a contaminant of adeno virus and hence the name. It is 22nm and low pathogenicity because no polymerase and cannot replicate till there is co-infector. Genome map is 4.7kb. 145 nucleotides inverted terminal repeats at both…

  • Synthetic biology

    The term is more complex than the function! Synthetic biology implies control of biological circuits through a control mechanism. Wikipedia calls it “redesign of existing biological systems for useful purposes”. This combines biology and engineering and the differentiator between yeast based beer making might be the control systems that are built in the system by…

  • |

    ALS and TDP43

    Almost 90% of ALS is thought to be sporadic. The reasons are not well understood and thought to be oxidative stress, damaged endoplasmic reticulum, mitochondria, cytoskeleton or misregulation of RNA pathways. One protein implicated is TDP43 which sits as a junction of ALS pathogenic path by binding to RNA modulation synthesis, splicing, stability and transport….