Advanced Statistical
Models

ΥΙΤΑΙ focus on innovation through the exploitation of data using advanced analytics and AI.

Clustering of gene/protein/metabolite abundance for the identification of common regulation and presence patterns with a variety of clustering algorithms and provision of help on identifying the best performing one for the client’s data. Use of ΥΙΤΑΙ heuristics algorithms to identify important variables such as the optimal number of clusters for an expression dataset.
Classification modeling, supervised and unsupervised learning for the detection of potential signatures characterizing several biological conditions (e.g., healthy vs diseased tissue).

Efficient use of popular unsupervised (e.g., k-means clustering) and superior supervised machine learning methodologies (e.g., Random Forests and Support Vector Machines) coupled with feature selection based on information content towards the identification of molecular (gene/protein/miRNA/metabolite) joint signatures able to distinguish between healthy and disease status or between pathologies which are hard to distinguish by macroscopic methods. Sensitivity/specificity and classification accuracy reports for the screening of potential drug targets.

Analysis of X-Seq data (other than RNA-, ChIP- and DNA/Exome-, for example FAIRE-Seq) requiring more specialized data handling and statistical modeling.Computational association of putative binding sites derived from ChIP-Seq experiments with gene expression (absolute RNA abundance or deregulated genes). Use of advanced algorithms for the derivation of association scores of TF profiles with gene expression.
Scanning for DNA motifs in promoters of genes belonging to similar expression groups, for the identification of common regulatory elements.
De novo motif discovery in ChIP-Seq data for the motif enrichment in binding sites and the identification of possible co-factors, using a combination of widely verified motif discovery tools. Motif clustering to identify regulatory element consensuses.
Advanced data visualizations and custom analytics upon discussions about the goals of the client.Network visualization of gene and metabolic networks based on public repositories and known protein-protein interactions.

Inference of chemical formula for metabolites and/or small molecules that could not be matched against any known database in metabolomics experiments.
Screening of public databases for gene/protein/miRNA/metabolite disease associations
Custom programming/scripting when existing tools are not sufficient to reach the analysis goals or when the client requires advanced data handling and visualization.

Explore our Artificial Intelligence (AI) Technologies

Life science
&
Bioinformatics services

Custom database
&
Software design

Contact us

Be Data Driven