Finding optimal finite biological sequences over finite alphabets: the OptiFin toolbox

06/25/2017
by   Régis Garnier, et al.
0

In this paper, we present a toolbox for a specific optimization problem that frequently arises in bioinformatics or genomics. In this specific optimisation problem, the state space is a set of words of specified length over a finite alphabet. To each word is associated a score. The overall objective is to find the words which have the lowest possible score. This type of general optimization problem is encountered in e.g 3D conformation optimisation for protein structure prediction, or largest core genes subset discovery based on best supported phylogenetic tree for a set of species. In order to solve this problem, we propose a toolbox that can be easily launched using MPI and embeds 3 well-known metaheuristics. The toolbox is fully parametrized and well documented. It has been specifically designed to be easy modified and possibly improved by the user depending on the application, and does not require to be a computer scientist. We show that the toolbox performs very well on two difficult practical problems.

READ FULL TEXT
research
06/25/2017

Well-supported phylogenies using largest subsets of core-genes by discrete particle swarm optimization

The number of complete chloroplastic genomes increases day after day, ma...
research
10/30/2020

In Searching of Long Skew-symmetric Binary Sequences with High Merit Factors

In this paper we present best-known merit factors of longer binary seque...
research
05/09/2018

Solving Sudoku with Ant Colony Optimisation

In this paper we present a new Ant Colony Optimisation-based algorithm f...
research
09/12/2019

Inverse Graphical Method for Global Optimization and Application to Design Centering Problem

Consider the problem of finding an optimal value of some objective funct...
research
04/09/2015

Extraction of Protein Sequence Motif Information using PSO K-Means

The main objective of the paper is to find the motif information.The fun...
research
01/09/2022

λ-Scaled-Attention: A Novel Fast Attention Mechanism for Efficient Modeling of Protein Sequences

Attention-based deep networks have been successfully applied on textual ...

Please sign up or login with your details

Forgot password? Click here to reset