Sonnhammer Master projects
Potential master thesis (exjobb) projects in the Sonnhammer group:
Projects are offered in the area of protein function prediction and analysis, mainly in three projects:
- Gene network construction and analysis
- Orthology identification and analysis
- Protein domain architecture evolution and analysis
Specific project descriptions are listed here:
- Development of an interactive website to explore protein networks
The FunCoup network database is a vast resource of functional coupling between proteins and genes. The goal of the project is to develop a new network viewer with modern web technologies that is integrated with the FunCoup database and can make full use of its features. The project will mainly involve programming.
- Pathway crosstalk enrichment visualisation
The PathwAX website provides online pathway annotation based on crosstalk derived through FunCoup's genome wide functional association networks. The goal is to develop a network visualisation tool that will show the crosstalk between a given query gene set and a give pathway. The project will mainly involve programming.
- Benchmarking of pathway analysis methods
BinoX is a new tool for measuring crosstalk between gene sets, particularly aimed at pathway annotation. The goal is to benchmark BinoX and other similar tools such as NEAT, NEA+, and LEGO, possibly also in combination with clustering of the gene sets. The project will involve script writing and analysis of results.
- Ultrafast sequence clustering for Pfam-B
Pfam is a comprehensive database of annotated Pfam domains. Between these Pfam-A domains are long stretches of unclassified sequences. In the past, homology-based methods were used to cluster these stretches into Pfam-B domains but with the huge growth of the sequence databases this method is no longer feasible. Instead we want to employ ultrafast alignment-free methods to make an approximate but feasible sequence clustering. The project will involve exploration of alignment-free algorithms and packages, programming, script writing, and analysis of results.
- Domain orthology
Orthology in InParanoid, Hieranoid, and most other ortholog databases is defined on the whole-protein level. The goal is to explore the difference when orthology is defined on the domain level, and to establish rules for when domain orthology can give an advantage. The project will involve script writing and analysis of results.
- Network-based identification of new disease genes.
The goal is to use the FunCoup network to identify genes not previously associated with a disease but with an enrichment of network links to known disease genes. Furthermore, conservation of the relevant subnetwork in model organisms will be analysed in order to identify particularly interesting new candidates. The project will involve script writing and analysis of results.
Previous projects:
- Development of an online Hieranoid database.
- A tool for analysing protein domain architecture evolution.
The goal is to develop a web-based tool that can display evolutionary trees of proteins and their domain architecture. This is very useful for understanding how domains have been shuffled during evolution. The tool should either be written in Java and integrated in PfamAlyzer or be implemented using JavaScript.
- Protein domain architecture evolution
- Benchmarking the next generation of homology inference tools
- Adaptive evolution in birds
- Interactive website to explore protein networks
Sonnhammer home