First and last name of the author. Title of the thesis BACHELOR THESIS. Charles University in Prague Faculty of Mathematics and Physics

Similar documents
Title of the Book. A. U. Thor XYZ University

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

Geotechnical Engineering Laboratory (CE 4348) Laboratory Report #1. Soil Classification. Group #1. Group member #1 Group member #2 Group member #3

A (Mostly) Correctly Formatted Sample Lab Report. Brett A. McGuire Lab Partner: Microsoft Windows Section AB2

DIMACS Book Series Sample

982T1S2 Assignment Topic: Trigonometric Functions Due Date: 28 th August, 2015

Richter Scale and Logarithms

ADVICE ON MATHEMATICAL WRITING

Warner, R. M. (2008). Applied Statistics: From bivariate through multivariate techniques. Thousand Oaks: Sage.

Regressions of Olympic Proportions

Introduction. How to use this book. Linear algebra. Mathematica. Mathematica cells

2 One-dimensional motion with constant acceleration

This is logically equivalent to the conjunction of the positive assertion Minimal Arithmetic and Representability

1 Introduction to Minitab

Properties of Continuous Probability Distributions The graph of a continuous probability distribution is a curve. Probability is represented by area

UWO CS2214 Feb. 4, Assignment #1 Due: Feb. 12, 2017, by 23:55 Submission: on the OWL web site of the course

Just Enough Likelihood

Commentary on candidate 9 evidence (Refractive Index)

Lab 1 Uniform Motion - Graphing and Analyzing Motion

PHYS 228 Template Example

Introduction to Uncertainty and Treatment of Data

1 A Review of Correlation and Regression

Writing MSc projects. General advice Discussion of some short pieces of mathematical writing Typesetting Questions?

How to Write a Laboratory Report

Biostatistics Student Seminar An Introduction To L A T E X

COMP 409: Logic Homework 5

How many states. Record high temperature

Paper Reference. Paper Reference(s) 6665/01 Edexcel GCE Core Mathematics C3 Advanced Level. Monday 12 June 2006 Afternoon Time: 1 hour 30 minutes

New Mexico Tech Hyd 510

Lab 6A: Application of Coulomb s Law, With Vernier Video Simulation

Chapter 2. Review of Mathematics. 2.1 Exponents

Collecting and Reporting Data

Describing the Relationship between Two Variables

Experiment 0 ~ Introduction to Statistics and Excel Tutorial. Introduction to Statistics, Error and Measurement

Statistics 5100 Spring 2018 Exam 1

BASIC TECHNOLOGY Pre K starts and shuts down computer, monitor, and printer E E D D P P P P P P P P P P

DISCRETE RANDOM VARIABLES EXCEL LAB #3

Exponential Functions Dr. Laura J. Pyzdrowski

GUIDED NOTES 4.1 LINEAR FUNCTIONS

Errata as of September 4, 2003 Biostatistical Methods: The Assessment of Relative Risks John M. Lachin

MATH 1150 Chapter 2 Notation and Terminology

Course Number 432/433 Title Algebra II (A & B) H Grade # of Days 120

College Algebra Through Problem Solving (2018 Edition)

Midterm. You may use a calculator, but not any device that can access the Internet or store large amounts of data.

ISO INTERNATIONAL STANDARD. Geographic information Spatial referencing by coordinates

Parameter Estimation and Hypothesis Testing in Linear Models

Accessible Topic - Topics accessible to visually impaired students using a screen reader.

ASSIGNMENT 3 SIMPLE LINEAR REGRESSION. Old Faithful

Using Microsoft Excel

INTERNATIONAL STANDARD

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Chemical Engineering Department University of Florida UNIT OPERATIONS LABORATORY. ECH 4224 L and ECH 4404 L GUIDELINES FOR WRITING LABORATORY REPORTS

Logarithmic Functions

A Note on Bayesian Inference After Multiple Imputation

The Model Building Process Part I: Checking Model Assumptions Best Practice

ArcGIS Pro: Essential Workflows STUDENT EDITION

physicsandmathstutor.com Paper Reference Core Mathematics C3 Advanced Level Monday 23 January 2006 Afternoon Time: 1 hour 30 minutes

Bishop Kelley High School Summer Math Program Course: Algebra 1 Part 2 Fall 2013

Passing-Bablok Regression for Method Comparison

System of Equations Review

Lab 6: Linear Algebra

Introduction to Computer Tools and Uncertainties

) The cumulative probability distribution shows the probability

STAT 350 Final (new Material) Review Problems Key Spring 2016

Edexcel GCE Core Mathematics C2 Advanced Subsidiary

Midterm Review Honors ICM Name: Per: Remember to show work to receive credit! Circle your answers! Sets and Probability

Student Technology Standards Scope and Sequence

Bootstrap metody II Kernelové Odhady Hustot

Joy Allen. March 12, 2015

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis

Math 70 and 95 Master Syllabus Beginning and Intermediate Algebra, 5th ed., Miller, O Neill, Hyde with ALEKS.

STAT FINAL EXAM

Learning Packet. Lesson 5b Solving Quadratic Equations THIS BOX FOR INSTRUCTOR GRADING USE ONLY

Edexcel GCE Core Mathematics C2 Advanced Subsidiary

Algorithms Exam TIN093 /DIT602

Box-Cox Transformations

Newton s Cooling Model in Matlab and the Cooling Project!

Introduction Snippets Conclusion References. Beamer snippets. October 21, Decock Beamer snippets

THE CRYSTAL BALL SCATTER CHART

Guidelines for the preparation of reports for the short experiments and the technological work

Statistical Practice

PHYA2. General Certificate of Education Advanced Subsidiary Examination January Mechanics, Materials and Waves

8.EE The Intersection of Two

MAS223 Statistical Inference and Modelling Exercises

Cambridge International AS and A Level Physics. Paper 5 Planning, Analysis and Evaluation

Core Mathematics C4 Advanced Level

1. Definition of a Polynomial

CLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition

Quantification of JEOL XPS Spectra from SpecSurf

Do Now 18 Balance Point. Directions: Use the data table to answer the questions. 2. Explain whether it is reasonable to fit a line to the data.

Paper Reference. Paper Reference(s) 6665/01 Edexcel GCE Core Mathematics C3 Advanced. Friday 6 June 2008 Afternoon Time: 1 hour 30 minutes

Lab #10 Atomic Radius Rubric o Missing 1 out of 4 o Missing 2 out of 4 o Missing 3 out of 4

The Monte Carlo Method

STUDY GUIDE Math 20. To accompany Intermediate Algebra for College Students By Robert Blitzer, Third Edition

Time: 1 hour 30 minutes

REALIZING TOURNAMENTS AS MODELS FOR K-MAJORITY VOTING

Nonlinear Regression. Summary. Sample StatFolio: nonlinear reg.sgp

GUIDED NOTES 5.6 RATIONAL FUNCTIONS

A Simple Introduction to EES Version (Handout version 5.1) Copyright C. S. Tritt, Ph.D. September 20, 2005

Transcription:

Charles University in Prague Faculty of Mathematics and Physics BACHELOR THESIS First and last name of the author Title of the thesis Name of the department or institute Supervisor of the bachelor thesis: Study programme: Specialization: RNDr. Name Surname, Ph.D. Mathematics Specialization Prague 2013

I thank... for.... (Not required.)

I declare that I carried out this bachelor thesis independently, and only with the cited sources, literature and other professional sources. I understand that my work relates to the rights and obligations under the Act No. 121/2000 Coll., the Copyright Act, as amended, in particular the fact that the Charles University in Prague has the right to conclude a license agreement on the use of this work as a school work pursuant to Section 60 paragraph 1 of the Copyright Act. In...................... on............. Author signature

Název práce: Czech title according to SIS Autor: First and last name of the author Katedra: Czech name of the department or institute Vedoucí bakalářské práce: RNDr. Name Surname, Ph.D., department Abstrakt: The abstract in Czech, 80 200 words long, not a copy of the assignment. Klíčová slova: 3 to 5 Czech keywords Title: English title according to the thesis assignment Author: First and last name of the author Department: English name of the department or institute Supervisor of the bachelor thesis: RNDr. Name Surname, Ph.D., department Abstract: The abstract in English, 80 200 words long, not a copy of the assignment. Keywords: 3 to 5 English keywords

Contents 1 Typesetting of mathematical text 2 1.1 Basic examples............................ 2 1.2 Mathematical expressions...................... 2 1.3 Definitions, theorems, proofs,..................... 4 2 Citations 5 2.1 Some examples............................ 5 3 Tables, figures, software code 6 3.1 Tables................................. 6 3.2 Figures................................. 7 3.3 Software code............................. 7 Bibliography 12 List of Figures 13 List of Tables 14 1

Chapter 1 Typesetting of mathematical text This chapter demonstrates a few examples of mathematical text typesetting. 1.1 Basic examples A number in the mathematical mode with decimal point: π. = 3.141 592 653 589. Test on a 5% level, 95% confidence interval. We have var(x) = E X 2 ( E X ) 2. 1.2 Mathematical expressions Let x 1 X =.. Note the period after the matrix. The mathematical text is a part of the sentence grammar and requires standard punctuation. Expressions that will be referenced should be numbered: x n x 1 X =. x n. (1.1) Expression (1.1) defines matrix X. To achieve better readability and organization, it is advised to number only those expressions that are referenced. Do not number all displayed expressions automatically. Aligning of expressions into several columns: S(t) = P(T > t), t > 0 (right-continuous), F (t) = P(T t), t > 0 (right-continuous). Two binded equations: } S(t) = P(T > t) F (t) = P(T t) t > 0 (right-continuous). (1.2) 2

Two centered unnumbered equations: Two centered numbered equations: Y = Xβ + ε, 1 x 1 X =... 1 x n Y = Xβ + ε, (1.3) 1 x 1 X =... (1.4) 1 x n Definition organized by cases: { 0 if r j is odd, P r j = r! ( 1) (r j)/2 if r j is even. Note the use of punctuation in the equation. Commas and periods are placed according to the standard English language rules. x = y 1 y 2 + y 3 y 5 + y 8... by (1.3) = y y by (1.4) = y(0)y by Axiom 1. (1.5) Two unnumbered aligned equations: n L(θ) = f i (y i ; θ), i=1 l(θ) = log { L(θ) } = n log { f i (y i ; θ) }. Two aligned equations, the first numbered: n L(θ) = f i (y i ; θ), (1.6) i=1 l(θ) = log { L(θ) } = i=1 n log { f i (y i ; θ) }. Two-line equation, the first line aligned left, the second line aligned right, unnumbered: l(µ, σ 2 ) = log { L(µ, σ 2 ) } n = log { f i (y i ; µ, σ 2 ) } i=1 3 i=1 = n 2 log(2πσ2 ) 1 2σ 2 n (y i µ) 2. i=1

Two-line equation, aligned to =, numbered in the middle: l(µ, σ 2 ) = log { L(µ, σ 2 ) } = n log { f(y i ; µ, σ 2 ) } i=1 = n 2 log(2πσ2 ) 1 2σ 2 n (y i µ) 2. i=1 (1.7) 1.3 Definitions, theorems, proofs,... Constructions like definitions, theorems, examples,..., should be separated from the surrounding text and (usually) numbered, with the use of cross-references. For each of these constructions, the main source file (BcPrace.tex) should define an environment providing its visual separation from the surrounding text and numbering/cross-referencing features. Definition 1. Let random variables X 1,..., X n be defined on the same probability space (Ω, A, P). Then the vector X = (X 1,..., X n ) is called a random vector. Definition 2 (random vector). Let random variables X 1,..., X n be defined on the same probability space (Ω, A, P). Then the vector X = (X 1,..., X n ) is called a random vector. Definition 1 demonstrates an environment without a subtitle, definition 2 includes a subtitle. Theorem 1. The random vector X is a measurable mapping of the space (Ω, A, P) to (R n, B n ). Lemma 2 (Anděl, 2007, str. 29). The random vector X is a measurable mapping of the space (Ω, A, P) to (R n, B n ). Proof. The detailed steps of the proof are described in the book Anděl (2007, p. 29). k Theorem 1 demonstrates an environment without a subtitle, lemma 2 includes a reference in the subtitle. Lemmas were defined in the main file so that they share the same numbering with theorems. 4

Chapter 2 Citations Citations are created by commands \citet, \citep etc. (see L A TEX package natbib) and processed by running BibTEX. In mathematical publications, citations are usually formatted as Author name (year of publication) or Author name [number of the reference]). Citations by author name are easier to handle in English than in Czech, because of its simpler grammar rules. 2.1 Some examples Kaplan and Meier (1958); Cox (1972) belong among the most highly cited statistical papers. Student (1908) wrote a paper on the t-test. Prof. Anděl authored a famous textbook on mathematical statistics (see Anděl, 1998). The theory of estimation is the topic of Lehmann and Casella (1998). When a reference to a specific result (definition, theorem, proof,... ) is made it is useful to include the number of the chapter, page, or theorem in the citation, e.g., Anděl (2007, Theorem 4.22) or (see Anděl, 2007, Chapter 4). Some papers have many coauthors. When citing a paper with three coauthors we usually list all of them at the first ocassion: Dempster, Laird, and Rubin (1977) introduced the concept of the EM algorithm. Alternatively: The concept of the EM algorithm was introduced in the late seventies (Dempster, Laird, and Rubin, 1977). When that same paper is cited again, we use a short citation: Dempster et al. (1977) offer several examples of the use of the EM algorithm. Papers with more than three coauthors are always cited in the short form: The first results from Project ACCEPT were published by Genberg et al. (2008). We never cite this paper as Genberg, Kulich, Kawichai, Modiba, Chingono, Kilonzo, Richter, Pettifor, Sweat, and Celentano (2008). 5

Chapter 3 Tables, figures, software code The inclusion of tables and figures in a scientific publication follows certain common and certain specific rules. Tables and figures are not included inside the text but placed either on dedicated pages or floated at the top or the bottom of a text page. L A TEX handles floating figures and tables automatically. Every table and figure must be numbered and accompanied with a legend. The legend should describe the contents of the table of figure with enough detail so that the reader can understand them without studying the text of the publication. Each table and figure should be refrenced by its number in the text. The text should summarize the most important conclusions that can be drawn from the table of figure. The text should be easy to follow and understand even without seeingf the figures and tables (and on the contrary, the figures and tables should be easy to understand even without reading the text). Figures and tables should be referenced indirectly in the sentences: instead of Table 3.1 shows that men are on average 9.9 kg heavier than women we write Men are on average 9, 9 kg heavier than women (see Table 3.1). 3.1 Tables Tables should be formatted according to the following rules: Avoid vertical lines. Separate the table from the surrounding text (even the legend) by stronger horizontal lines. Separate the header from the table and different parts of the table from each other by thinner horizontal lines. This table format can be obtained in L A TEX by loading the package booktabs. If a stronger separation of the columns is desired it can be achieved by including an additional vertical space. Std. Effect Estimate error a P-value Intercept 10.01 1.01 Gender (male) 9.89 5.98 0.098 Height (cm) 0.78 0.12 < 0.001 Note: a Standard error of the estimator by the Monte Carlo method. Table 3.1: Maximum likelihood estimates from model M. 6

Do not change the type, format and meaning of the cells in a single column (never include means in some cells and percentages in other cells of the same column). Do not repeat the same cell contents many times. If the table includes the column Variance, which contains the value 0.5 in the first ten cells and 1.5 in the next ten cells, delete this column and find another way to communicate the value of the variance. For example, the table can be divided into two separate tables and the variance given in the legend. Or additional rows can be included between the parts of the table informing what was the variance in the subsequent rows. Numeric columns in the table should be aligned on the decimal point. Tables sometimes include abbreviations that are not used elsewhere in the text. These abbreviations should be explained in the legend or in notes under the table. These notes can be also used to provide more detail on the meaning of selected columns or cells. 3.2 Figures Some advice on figures and diagrams: Create the figure in the same size that will be used in the thesis. Excessive magnification or reduction of figures causes poor readability. The axes of a graph must be carefully annotated in the same language the thesis is written in. Units of measurement (kg, minutes,... ) should be provided when applicable. When the graph plots a function h(x) the axes should be annotated by x a h(x). Each axis must have a clearly defined scale (tickmarks, labels). If a two-dimensional scatterplot includes a large number of points make sure that the points do not turn into black cloud. If the number of points is too large reduce the size of the plotting symbol or select a subset of the points. Plots that include thousands of points make problems in electronic documents, they increase the size too much. If the thesis is to be printed on a black-and-white printer, do not use colors. Lines can be distinguished by the type (solid, dotted,... ), areas can be filled by various shades of grey or The meaning of the line types and area shading should be explained in the legend or directly in the plot. The command \psfrag can replace parts of ps/eps files (usually annotations in the graphs) by an arbitrary sequence of L A TEX commands, as the following examples illustrate. 3.3 Software code Software code or computer output (if needed in the thesis) should be formatted differently from the other text. One option is to use L A TEX package fancyvrb (fancy verbatim), which is used to define the environment PCinout in the master file BcPrace.tex. With this environment, we can generate the following examples: 7

> mean(x) [1] 158.90 > objekt$mean [1] 158.90 Smaller font: > mean(x) [1] 158.90 > objekt$mean [1] 158.90 No frame: > mean(x) [1] 158.90 > objekt$mean [1] 158.90 Narrow frame: > mean(x) [1] 158.90 > objekt$mean [1] 158.90 8

Plot y 2 1 0 1 2 3 2 1 0 1 2 3 x Figure 3.1: Random sample from distribution N 2 (0, I). 9

f(x) 0.00 0.02 0.04 0.06 0.08 m = 100, s = 15 m = 110, s = 10 m = 120, s = 5 60 80 100 120 140 x Figure 3.2: Densities of several normal distributions. 10

m = 100, s = 15 f(x) 0.00 0.02 0.04 0.06 0.08 60 80 100 120 140 x m = 110, s = 10 m = 120, s = 5 f(x) 0.00 0.02 0.04 0.06 0.08 f(x) 0.00 0.02 0.04 0.06 0.08 60 80 100 120 140 x 60 80 100 120 140 x Figure 3.3: Densities of several normal distributions. 11

Bibliography J. Anděl. Statistické metody. Druhé přepracované vydání. Matfyzpress, Praha, 1998. ISBN 80-85863-27-8. J. Anděl. Základy matematické statistiky. Druhé opravené vydání. Matfyzpress, Praha, 2007. ISBN 80-7378-001-1. D. R. Cox. Regression models and life-tables (with Discussion). Journal of the Royal Statistical Society, Series B, 34(2):187 220, 1972. A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39(1):1 38, 1977. B. L. Genberg, M. Kulich, S. Kawichai, P. Modiba, A. Chingono, G. P. Kilonzo, L. Richter, A. Pettifor, M. Sweat, and D. D. Celentano. HIV risk behaviors in sub-saharan Africa and Northern Thailand: Baseline behavioral data from project Accept. Journal of Acquired Immune Deficiency Syndrome, 49:309 319, 2008. E. L. Kaplan and P. Meier. Nonparametric estimation from incomplete observations. Journal of the American Statistical Association, 53(282):457 481, 1958. E. L. Lehmann and G. Casella. Theory of Point Estimation. Second Edition. Springer-Verlag, New York, 1998. ISBN 0-387-98502-6. Student. On the probable error of the mean. Biometrika, 6:1 25, 1908. 12

List of Figures 3.1 Random sample from distribution N 2 (0, I)............. 9 3.2 Densities of several normal distributions............... 10 3.3 Densities of several normal distributions............... 11 13

List of Tables 3.1 Maximum likelihood estimates from model M............ 6 14