Welcome to the SUAM Project

SUAM stands for Speeding Up data Analysis for Mexico with artificial intelligence and distributed computing. Is a project based on a previous work done and with results being evaluated by a Scientific Committee.

The SUAM's origin

Concerned about the current situation about the SARS-CoV-2 in Mexico, a group of young, enthusiasts and very capable people decided to start this project.

Inspired on

The main idea, that is to say, bring some technologies in just one, and also the architecture idea, is based upon an article series: https://scholar.google.com/citations?user=wpPYUQUAAAAJ&hl=en

About the Project

The SUAM project combines some of the best software (to our knowledge), tools and related technologies in the next fields:

Bioinformatics.
Machine learning (ML). Focused on classification and clusterization.
Deep learning (DL).
Distributed computing (DC).

Built on top

For bioinformatics:

For ML/DL:

For DC:

Ray

Structure

The project folders are:

bio. Contains the bioinformatics framework for sequences alignments and it would be intended for future use to, among others: molecular mechanics.
cl. Stands for classifiers; related to classification problems in ML.
dl. Framework for DL. Actually supporting: Keras, PyTorch and Scikit-learn.
parsers. Defines the JSONParser class in its __init__ file. This class is responsible to parse the main JSON configuration file where the tools (for bioinformatics, classification and deep learning) can be specified, and also their parameters.
runners and tests. These folders could be deprecated in future versions (note that each folder -i.e. bio, cl and dl folders-, as required, contains its own folders).

In the bio, cl and dl folders you will find, among others, the next two main files:

cfg.json. Contains the configuration for each tool supported in the project.
requirements.prod. The Python required modules (remember: run pip install -r requirements.prod before anything else) for each case (bioinformatics, classifier and deep learning).

A special case: the `bio` folder

The bio folder and its tests is the most advanced in comparison to ML/DL, and this is deliberate, because, in comparison to the latter, their tools (Clustal Omega and MUSCLE) are not totally tighted to Python.

So, in order to reduce the required time and efforts you will find, in the bio folder, the scripts folder, where, among others, you will see the install.sh file. Please run this file to be able to execute Clustal Omega and MUSCLE, which are dependencies for the SUAM's bioinformatics framework.

Finally, as we said, the bio folder contains tests and their results (in its named folders).

What's next?

Nowadays we are working with a new (second) paper with the first results from our experiments and we expect to release the architecture first version in these days.
Build and run the tests sets for ML/DL.
Analyse the results of the step above.
Start a new article.

Colaborators

We're looking for young, enthusiasts and very capable people, software engineers, data scientists, computing specialists, information technologies(IT) specialists, on the levels: student (more than the 50% from curricula approved) and/or engineer.

If you believe that can help on this please write to: [[email protected]](mailto:[email protected]?subject=SUAM Colaborator "[email protected]")

Sponsors / Partially funded by

Not actually looking for money, right now we're looking devices to build a devices cluster for the data processing. Do you have an old machine and you can't sell it? Do you have an old machine and do you want to dispose it? Don't sell it, don't dispose it, donate it for the project.

If you believe that can help on this please write to: [[email protected]](mailto:[email protected]?subject=SUAM Sponsor "[email protected]")

Supported by

Are you a small-medium size organization? Would you like that your company logo appear in the project's site, or its derivated works?

If you believe that can help on this please write to: [[email protected]](mailto:[email protected]?subject=SUAM Supporter "[email protected]")

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
bio		bio
cl		cl
dl		dl
parsers		parsers
runners		runners
tests		tests
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml
classes.png		classes.png
classes__.png		classes__.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Welcome to the SUAM Project

The SUAM's origin

Inspired on

About the Project

Built on top

Structure

A special case: the `bio` folder

What's next?

Colaborators

Sponsors / Partially funded by

Supported by

About

Releases

Packages

Languages

License

EDario333/suam

Folders and files

Latest commit

History

Repository files navigation

Welcome to the SUAM Project

The SUAM's origin

Inspired on

About the Project

Built on top

Structure

A special case: the bio folder

What's next?

Colaborators

Sponsors / Partially funded by

Supported by

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

A special case: the `bio` folder

Packages