Kish International Campus

Ph.D. Degree Program in

Bioinformatics

Introduction


Bioinformatics is an interdisciplinary science at the interfaces of the biological, informational and computational sciences, uses computation to better understand biology. Bioinformatics involve the analysis of biological data, particularly DNA, RNA, and protein sequences. The field of bioinformatics experienced explosive growth starting in the mid-1990s, driven largely by the Human Genome Project and by rapid advances in DNA sequencing technology. Recent and novel technologies produce biological data sets of ever-increasing resolution that reveal not only genomic sequences but also RNA and protein abundances, their interactions with one another, their subcellular localization, and the identity and abundance of other biological molecules. This requires the development and application of sophisticated computational methods. Bioinformatics utilizes computational approaches to analyze patterns in biological data and to create complex models of biological activity, including attempts to elucidate the functions of genes and their interactions in genetic pathways. Widespread social benefits are expected from the exploitation of the wealth of new knowledge concerning the genetic mechanisms of life and related processes.
Analyses in bioinformatics predominantly focus on three types of large datasets available in molecular biology: macromolecular structures, genome sequences, and the results of functional genomics experiments (e.g. expression data). Additional information includes the text of scientific papers and "relationship data" from metabolic pathways, taxonomy trees, and protein-protein interaction networks. Bioinformatics employs a wide range of computational techniques including sequence and structural alignment, database design and data mining, macromolecular geometry, phylogenetic tree construction, prediction of protein structure and function, gene finding, and expression data clustering. The emphasis is on approaches integrating a variety of computational methods and heterogeneous data sources.
The main objective of the Ph.D. program in Bioinformatics at Kish international campus is to train the next generation of computational biologists for careers in academia, industry, and government.
 

PhD Curriculum


The PhD of Bioinformatics requires completion of 32 credits, a set of core courses (9 credits), a seminar (1 credit) and 8 credits of elective courses and a PhD thesis (18 credits). The main emphasis of the program is on the successful completion of an original and independent research project written and defended as a dissertation.

Comprehensive Exam


Comprehensive Exam should be taken at most at the end of the 4th semester and is required before a student could defend the Ph.D. proposal. Students will have two chances to pass the Ph.D. Comprehensive Exam. If students receive an evaluation of "unsatisfactory" on their first Comprehensive Exam attempt, the student may retake the qualifier once. A second failure will result in termination from the program. The Comprehensive Exam is designed to ensure that the student starts early in gaining research experience; it also ensures that the student has the potential to conduct doctoral-level research.

Ph.D. PROPOSAL


The Ph.D. proposal must contain Specific Aims, Research Design and Methods, and Proposed Work and Timeline. In addition the proposal must also contain a bibliography and, as attachments, any publications/supplementary materials. The student must defend their thesis proposal to their committee in an oral exam.

THESIS


A student should choose a thesis advisor (and one or two co-advisors if required) within the first year of being in the PhD program, approved by the Faculty committee. In the second year a thesis committee suggested by the advisor alongside by the Ph.D. proposal should be handed over for approval. The thesis committee should consist of a minimum of five faculty members. Two members of thesis committee should be from the other Universities at the associate Professor level. Not later than the end of the 5th semester a student has to present and defend a written PhD proposal.

RESEARCH PROGRESS


A student is expected to meet with his/her thesis committee at least once a year to review the research progress. In the beginning of each university calendar year, each student and the student's advisor are required to submit an evaluation assessment of the student's progress, outlining past year accomplishments and plans for the current year. The thesis committee reviews these summaries and sends the student a letter summarizing their status in the program. Students who are failing to make satisfactory progress are expected to correct any deficiencies and move to the next milestone within one year. Failure to do so will result in dismissal from the program.

PhD DISSERTATION


Within 4 years after entering the PhD program, the student is expected to complete the thesis research; the student must have the results of the research accepted or published in peer reviewed journals. Upon submitting a written thesis and public defense and approval by the committee, the student is awarded the PhD degree. The defence will consist of (1) a presentation of the dissertation by the graduate student, (2) questioning by the general audience, and (3) closed door questioning by the dissertation committee. The student will be informed of the exam result at the completion of all three parts of the dissertation defense. All members of the committee must sign the final report of the doctoral committee and the final version of the dissertation.

A minimum GPA of 16 over 20 must be maintained for graduation.

Leveling Courses (not applicable to degree)


The Ph.D. in Bioinformatics assumes a Master degree in related fields. However students holding any other master degree besides will be required to complete a few of the following leveling courses that are designed to provide a back ground for the Ph.D. courses.  These leveling courses are not counted for graduate credit towards the Ph.D. in Bioinformatics.

Leveling courses: At most 3 courses required; 6 credits

 

Course

Credits

Hours

Biochemistry

2

32

Bioinformatics

2

32

Topics in Molecular Biology

2

32

Molecular Biophysics

2

32

Statistics

2

32

Computer programing

2

32

Algorithm and data structure

2

32

Core courses: 4 courses required; 10 credits

Course

Credits

Hours

Theory

Practice

Total

Theory

Practice

Total

Advanced bioinformatics

2

1

3

32

32

64

Algorithms in bioinformatics

3

0

3

48

0

48

Bioinformatics Database

2

1

3

32

32

64

Seminar

1

0

1

16

0

16

Total

8

2

10

128

64

192

Elective Courses: 4 courses required, 8 credits

Course

Credits

Hours

Structural bioinformatics

2

32

Computational genomics

2

32

Metabolic modeling

2

32

Modeling in system biology

2

32

Advanced data mining

2

32

Machin learning

2

32

Computer aided Drug design

2

32

Total

14

224

 

 

Course Descriptions

 

Advanced bioinformatics

 

Course Content:


Introduction to Bioinformatics, Introduction to Molecular Biology, Biological Databases, Processing Biological Sequences with MATLAB, Sequence Homology, Protein Alignments, Multiple Sequence Alignment, Alignment Tools, Biolinguistic Methods, Sequence Models, Subsequence Pattern Models, Gene Models, Introduction to Phylogenetic Reconstruction, Distance Based Methods, Character Based Methods: Parsimony, Probabilistic Methods: Maximum Likelihood, Microarrays, Matlab

References

 

 

[1] J. Pevsner, Bioinformatics and Functional Genomics, John Wiley & Sons, 2015.
[2] G. B. Singh, Fundamentals of Bioinformatics and Computational Biology: Methods and Exercises in MATLAB, Springer, 2014.

 


Algorithms in bioinformatics

 

Course Content:


Introduction to Molecular Biology, Sequence Similarity, Suffix Tree, Genome Alignment, Database Search, Multiple Sequence Alignment, Phylogeny Reconstruction, Phylogeny Comparison, Genome Rearrangement, Motif Finding, RNA Secondary Structure Prediction, Peptide Sequencing, Population Genetics

 

References

 

 

[1] W.-K. Sung, Algorithms in Bioinformatics: A Practical Introduction, CRC Press, 2009.
[2] K. Erciyes, Distributed and Sequential Algorithms for Bioinformatics, Springer, 2015.

 


Structural bioinformatics

 

Course Content:


Constrain molecular modeling, Defining bioinformatics and structural, Fundamentals of protein structure, Search and sampling in structural ,Search methods ,Data analysis and reduction ,Molecular visualization

 

References

 

 

[1] J. Gu and P. E. Bourne, Structural Bioinformatics, John Wiley & Sons, 2011.
[2] F. J. Burkowski, Structural Bioinformatics: An Algorithmic Approach, CRC Press, 2008.

 


Computational genomics

 

Course Content:


Introduction , Concepts of Genetic Epidemiology, Integration of Linkage Analysis and Next Generation Sequencing Data , QTL Mapping of Molecular Traits for Studies of Human Complex Diseases , Renewed Interest in Haplotype From Genetic Marker to Gene Prediction , Analytical Approaches for Exome Sequence Data , Rare Variants Analysis in Unrelated Individuals , Gene Duplication and Functional Consequences, From GWAS to Next Generation Sequencing on Human Complex Diseases The Implications for Translational Medicine and Therapeutics

 

References

 

 

[1] Y. Y. Shugart, Applied Computational Genomics, Springer, 2012.
[2] N. Saitou, Introduction to Evolutionary Genomics, Springer, 2014.

 


Metabolic modeling

 

Course Content:


Engineering Synthetic Metabolons from Metabolic Modeling to Rational Design of Biosynthetic Devices, Building synthetic sterols computationally unlocking the secrets of evolution? ,Characteristics of Sucrose Transport through the Sucrose Specific Porin ScrY Studied by Molecular Dynamics Simulations ,Fast Solver for Implicit Electrostatics of Biomolecules ,Model Based Design of Biochemical Microreactors ,Underpinning starch biology with in vitro studies on carbohydrate active enzymes and biosynthetic glycomaterials ,Compartmentalization and Transport in Synthetic Vesicles ,Metabolomics standards and metabolic modeling for synthetic biology in plants ,Are Predictions Consistent with Experimental Evidence? ,Optimization of Engineered Production of the Glucoraphanin Precursor Dihomomethionine in Nicotiana benthamiana ,Synthetic Peptides as Protein Mimics ,Synthetic Protein Scaffolds Based on Peptide Motifs and Cognate Adaptor Domains for Improving Metabolic Productivity ,Engineering of metabolic pathways by artificial enzyme channels

 

References

 

 

[1] L. M. Voll and Z. Nikoloski, Engineering Synthetic Metabolons: From Metabolic Modelling to Rational Design of Biosynthetic Devices, Frontiers Media SA, 2016.
[2] C. Smolke, The Metabolic Pathway Engineering Handbook: Fundamentals, CRC Press, 2009.

 


Modeling in system biology

 

Course Content:


Biological Basics, Fundamentals of Mathematical Modeling, Model Calibration and Experimental Design, Modeling of Cellular Processes, Enzymatic Conversion, Polymerization Processes, Signal Transduction and Genetically Regulated Systems, Analysis of Modules and Motifs, General Methods of Model Analysis, Aspects of Control Theory, Motifs in Cellular Networks, Analysis of Cellular Networks, Metabolic Engineering, Topological Characteristics

 

References

 

 

[1] M. i. S. B. T. P. N. Approach, Koch,Ina ; Reisig, Wolfgang ;Schreiber,Falk, Springer, 2010.
[2] B. P. Ingalls, Mathematical Modeling in Systems Biology: An Introduction, MIT Press, 2013.

 


Advanced data mining

 

Course Content:


Introduction to Data Mining in Bioinformatics, Hierarchical Profiling Scoring and Applications in Bioinformatics Methods and Practices of Combining Multiple Scoring Systems, DNA Sequence Visualization, Proteomics with Mass Spectrometry, Efficient and Robust Analysis of Large Phylogenetic Datasets, Algorithmic Aspects of Protein Threading, Pattern Differentiations and Formulations for Heterogeneous Genomic Data, Parameterless Clustering Techniques for Gene Expression Analysis, Joint Discriminatory Gene Selection for Molecular Classification of Cancer, A Haplotype Analysis System for Genes Discovery of Common Diseases, A Bayesian Framework for Improving Clustering Accuracy of Protein Sequences

 

References

 

 

[1] H.-H. Hsu, Advanced Data Mining Technologies in Bioinformatics, Idea Group Inc (IGI), 2006.
[2] S. Dua and P. Chowriappa, Data Mining for Bioinformatics, CRC Press, 2012.

 


Machin learning

 

Course Content:


Why we are interested in machine learning, Machine learning statistics and data analytics, Pattern recognition, Neural networks and deep learning, Learning clusters and recommendations, Learning to take actions, Where do we go from here?

 

References

 

 

[1] E. Alpaydin, Machine Learning: The New AI, MIT Press, 2016.
[2] S. Raschka, Python Machine Learning, Packt Publishing Ltd, 2015.

 


Computer aided Drug design

 

Course Content:


Quantum Mechanical and Molecular Mechanical Approaches, Transition Metal Systems, Modeling Protein Protein Interactions by Rigid body, QM Based Modelling, Current Status and Future

RESEARCH

 

 

[1] L. Banting and T. Clark, Drug Design Strategies: Computational Techniques and Applications, Royal Society of Chemistry, 2012.
[2] N. Brown, In Silico Medicinal Chemistry: Computational Methods to Support Drug Design, Royal Society of Chemistry, 2015.