Show simple item record

dc.contributor.authorKallus, Jonatan
dc.date.accessioned2020-05-07T17:11:32Z
dc.date.available2020-05-07T17:11:32Z
dc.date.issued2020-05-07
dc.identifier.isbn978-91-7833-888-7
dc.identifier.isbn978-91-7833-889-4
dc.identifier.urihttp://hdl.handle.net/2077/63747
dc.description.abstractGenomic data describe biological systems on the molecular level and are, due to the immense diversity of life, high-dimensional. Network modeling and integrative analysis are powerful methods to interpret genomic data. However, network modeling is limited by the requirement to select model complexity and due to a bias towards biologically unrealistic network structures. Furthermore, there is a need to be able to integratively analyze data sets describing a wider range of different biological aspects, studies and groups of subjects. This thesis aims to address these challenges by using resampling to control the false discovery rate (FDR) of edges, by combining resampling-based network modeling with a biologically realistic assumption on the structure and by increasing the richness of data sets that can be accommodated in integrative analysis, while facilitating the interpretation of results. In paper I, a statistical model for the number of times each edge is included in network estimates across resamples is proposed, to allow for estimation of how the FDR is affected by sparsity. Accuracy is improved compared to state-of-the-art methods, and in a network estimated for cancer data all hub genes have documented cancer-related functions. In paper II, a new method for integrative analysis is proposed. The method, based on matrix factorization, introduces a versatile objective function that allows for the study of more complex data sets and easier interpretation of results. The power of the method as an explorative tool is demonstrated on a set of genomic data. In paper III, network estimation across resamples is combined with repeated community detection to compensate for the structural bias inherent in common network estimation methods. For estimation of the regulatory network in human cancer, this compensation leads to an increased overlap with a database of gene interactions. Software implementations of the presented methods have been published. The contributed methods further the understanding that can be gained from high-dimensional genomic data, and may thus help to devise new treatments and diagnostics for cancer and other diseases.sv
dc.language.isoengsv
dc.relation.haspart1. Kallus, J., Sánchez, J., Jauhiainen, A., Nelander, S., Jörnsten, R. (2017). ROPE: high-dimensional network modeling with robust control of edge FDR. Preprint arxiv.org/abs/1702.07685sv
dc.relation.haspart2. Kallus, J., Johansson, P., Nelander, S., Jörnsten, R. (2019). MM-PCA: integrative analysis of multi-group and multi-view data. Preprint arxiv.org/abs/1911.04927sv
dc.relation.haspart3. Kallus, J., Nelander, S., Jörnsten, R. (2020). Large-scale network estimation with structure-adaptive stability selection. Manuscriptsv
dc.subjectMathmatical statisticssv
dc.subjectBiostatisticssv
dc.titleNetwork modeling and integrative analysis of high-dimensional genomic datasv
dc.title.alternativeNätverksmodellering och integrativ analys av högdimensionell genomikdatasv
dc.typeText
dc.type.svepDoctoral thesiseng
dc.gup.mailkallus@chalmers.sesv
dc.type.degreeDoctor of Philosophysv
dc.gup.originGöteborgs universitet. Naturvetenskapliga fakultetensv
dc.gup.departmentDepartment of Mathematical Sciences ; Institutionen för matematiska vetenskapersv
dc.gup.defenceplaceOnsdagen den 10 juni 2020, kl. 13.15, Sal Pascal, Matematiska vetenskaper, Chalmers tvärgata 3sv
dc.gup.defencedate2020-06-10
dc.gup.dissdb-fakultetMNF


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record