Recent Posts

Article summary XII

2 minute read

Summary One often overlooked complication with high-throughput studies is batch effects, which occur because measurements are affected by laboratory conditio...

Article summary XI

2 minute read

Summary Gene Set Enrichment Analysis (GSEA), first introduced by mootha et al., is a method to identify classes of genes or proteins that are over-represente...

Article summary X

2 minute read

Summary Jan Brauner et al. and collaborators published Inferring the effectiveness of government interventions against COVID-19 in Science. The authors gathe...

Article summary IX

2 minute read

Summary The paper “Genetic Signatures of Exceptional Longevity in Humans”, published in Plos One in 2012 by lead researchers Paola Sebastiani of Boston Unive...

Article summary VIII

2 minute read

Summary Benjamini et al. suggested a new point of view on the problem of multiple hypothesis testing, which was to control the expected proportion of errors ...

Article summary VII

2 minute read

Summary Benzer (1955) described a functionally related region in the genetic material of a bacteriophage that was finely subdivisible by mutation and by gene...

Article summary VI

2 minute read

Summary Using a variety of techniques, Parvanov et al., Baudat et al., and Myers et al. described the major discovery of PR domain zinc finger protein 9 (PRD...

Article summary V

2 minute read

Summary The bootstrap, introduced by Bradley Efron (1979), is a nonparametric way of estimating standard errors by resampling the data. Freedman and Peters (...

Article summary IV

2 minute read

Summary Eklund et al. used large scale experimental data rather than simulated data to test the validity of statistical methods for functional magnetic reson...

Article summary III

2 minute read

Summary Following the report that detailed numerous problems with the data, Baggerly and Coombes identified the 5 things that should be supplied when article...

Article summary II

2 minute read

Summary Baggerly and Coombes examined several related papers purporting to use microarray-based signatures of drug sensitivity derived from cell lines to pre...

Article summary I

2 minute read

Summary David Donoho’s “50 Years of Data Science” provided very thoughtful retrospectives and perspectives on data science and how it relates to Statistics. ...

Article summary XIV

2 minute read

Summary t-distributed Stochastic Neighbor Embedding (usually called t-SNE), proposed by van der Maaten and Hinton in 2008, is an unsupervised, non-linear tec...

Article summary XIII

2 minute read

Summary In the paper: “Statistical Modeling: The Two Cultures”, Leo Breiman, the creator of the random forest algorithm, describes two contrasting approaches...

Article summary XII

2 minute read

Summary Deep learning methods have been applied to biology and medicine for several decades, but recently the field has seen a true explosion of interest bec...

Article summary XI

1 minute read

Summary LeCun et al.(2015) described the process of deep learning, which discoverd intricate structure in large data sets by using the backpropagation algori...

Article summary X

2 minute read

Summary Lex et al.(2014) introduced a visualization technique UpSet that employed a matrix-based layout to show intersections of sets and their sizes. UpSet ...

Article summary IX

2 minute read

Summary Ryan and colleagues performed a medication-wide association study (MWAS) and scanned 88 to 118 drugs in a case-control setting in cohorts derived fro...

Article summary VIII

2 minute read

Summary Pearl introduced empirical researchers to the latest developments in causal inference and emphasized the paradigmatic shift that must be carried out ...

Article summary VII

2 minute read

Summary Rosenbaum and Rubin suggested the use of so-called balancing scores b(X), i.e. functions of the relevant observed covariates X such that the conditio...

Article summary VI

2 minute read

Summary This Berry commentary provided new analyses and evaluations to address some important issue in the question of whether or not to recommend regular ma...

Article summary V follow up

2 minute read

Summary The Stampfer commentary pointed out some of the weaknesses inherent in the application of the “intention-to treat” principle to analyze the observati...

homework_1 (Turning Table into Graph)

3 minute read

The table showed age-adjusted annual incidence and mortality rates (per 100 000 person-years), disability-adjusted life-years (DALYs) lost, prevalence (per 1...

Article summary V

2 minute read

Summary In this issue, Hernán et al reanalyzed Nurses’ Health Study (NHS) Data on hormone therapy and heart disease using an observational analog of intentio...

Article summary IV

1 minute read

Summary Machine learning (ML) has been employed to develop cancer risk models for nonmelanoma skin cancer (NMSC). This study is a good example. In this study...

Article summary III

2 minute read

Summary The Lipid Research Clinics Coronary Primary Prevention Trial (LRC-CPPT) tested the efficacy of lowering cholesterol levels in reducing the risk of co...

Article summary II

1 minute read

Summary In the mid-1800s, London physician John Snow made a startling observation that would change the way that we view diseases and how they propagate. In ...

Article summary I

1 minute read

Summary In this paper, Gelman et al. went through seven interesting and representative examples of tables from the March 2000 issue of the Journal of the Ame...