Title: Survival Function Analysis of Planet Orbit Semi-major Axis Distribution and Occurrence Rate Estimate

Similar documents
Title: Survival Function Analysis of Planet Size Distribution

The California-Kepler Survey. III. A Gap in the Radius Distribution of Small Planets

Discovering Exoplanets Transiting Bright and Unusual Stars with K2

Occurrence of 1-4 REarth Planets Orbiting Sun-Like Stars

Precise Stellar Parameters are Crucial for Constraining Transiting Exoplanet Interiors

MINIMUM MASS EXTRASOLAR NEBULA DERIVED FROM THE MASS OF EXTRASOLAR PLANETARY SYSTEMS

Towards Better Planet Occurrence Rates from Kepler and K2 Andrew Vanderburg

Planet Occurrence Within 0.25 AU

Hunting Habitable Shadows. Elizabeth Tasker

Imprints of Formation on Exoplanets

Revealing the Fundamental Properties of Exoplanets and their Host Stars with Alopeke

Importance of the study of extrasolar planets. Exoplanets Introduction. Importance of the study of extrasolar planets

Calculating the Occurrence Rate of Earth-Like Planets from the NASA Kepler Mission

The Physics of Exoplanets

Kepler s Multiple Planet Systems

Internal structure and atmospheres of planets

Dynamical Stability of Terrestrial and Giant Planets in the HD Planetary System

Multiplanet Systems as Laboratories for Planet Formation

Super-Earths as Failed Cores in Orbital Migration Traps

II Planet Finding.

Search for & Characterizing Small Planets with NASA s Kepler Mission

Why Search for Extrasolar Planets?

The exoplanet eccentricity distribution from Kepler planet candidates

Science Olympiad Astronomy C Division Event National Exam

Astronomy 1140 Quiz 4 Review

Hierarchical Bayesian Modeling of Planet Populations. Angie Wolfgang NSF Postdoctoral Fellow, Penn State

Origins of Gas Giant Planets

Small Planets, Small Stars:

REDUCED LIGHT CURVES FROM CAMPAIGN 1 OF THE K2 MISSION

Young Solar-like Systems

SIMULTANEOUS FORMATION OF GIANT PLANETS

4. Direct imaging of extrasolar planets. 4.1 Expected properties of extrasolar planets. Sizes of gas giants, brown dwarfs & low-mass stars

arxiv:astro-ph/ v1 2 Oct 2002

The formation & evolution of solar systems

Extrasolar planets and their hosts: Why exoplanet science needs X-ray observations

Data from: The Extrasolar Planet Encyclopaedia.

Characterizing Small Planets via Spectroscopy of their Host Stars Johanna Teske in collaboration with

Lecture 20: Planet formation II. Clues from Exoplanets

Kepler Planets back to the origin

[25] Exoplanet Characterization (11/30/17)

Exoplanet Mass, Radius, and the Search for Habitable Worlds

Radiation from planets

8. Solar System Origins

II. Results from Transiting Planets. 1. Global Properties 2. The Rossiter-McClaughlin Effect

Outline. RV Planet Searches Improving Doppler Precision Population Synthesis Planet Formation Models Eta-Earth Survey Future Directions

Architecture and demographics of planetary systems

Planet formation in protoplanetary disks. Dmitry Semenov Max Planck Institute for Astronomy Heidelberg, Germany

Origin of the Solar System

WFIRST Preparatory Science (WPS) Project: The Circumstellar Environments of Exoplanet Host Stars (NNH14ZDA001N-WPS; PI: Christine Chen)

Stars from Kepler. Courtney Dressing

Life in the Universe (1)

Helmut Lammer Austrian Academy of Sciences, Space Research Institute Schmiedlstr. 6, A-8042 Graz, Austria (

Extrasolar Planets. Properties Pearson Education Inc., publishing as Pearson Addison-Wesley

How Shadowing and Illumination in Disks Affect Planet Formation

Why Should We Expect to Find Other Planets? Planetary system formation is a natural by-product of star formation

On the relation between stars and their planets

What makes a planet habitable?

Extrasolar Planets. Materials Light source to mimic star Ball to mimic planet Light meter Interface

Direct imaging of extra-solar planets

Who was here? How can you tell? This is called indirect evidence!

EXOPLANET LECTURE PLANET FORMATION. Dr. Judit Szulagyi - ETH Fellow

ASTR 200 : Lecture 6 Introduction to the Solar System Pearson Education Inc., publishing as Addison-Wesley

Lecture 12: Extrasolar planets. Astronomy 111 Monday October 9, 2017

NASA's Kepler telescope uncovers a treasure trove of planets

A New Empirical Constraint on the Prevalence of Technological Species in the Universe. - A. Frank 1 & W.T. Sullivan, III 2

What makes a planet habitable?

III The properties of extrasolar planets

COMPOSITIONS OF HOT SUPER-EARTH ATMOSPHERES: EXPLORING KEPLER CANDIDATES

10/16/ Detecting Planets Around Other Stars. Chapter 10: Other Planetary Systems The New Science of Distant Worlds

Which of the following correctly describes the meaning of albedo?

Revisiting the Radius Gap with Gaia DR2

Searching For Planets Like Earth around stars like the Sun

How inner planetary systems relate to inner and outer debris belts. Mark Wyatt Institute of Astronomy, University of Cambridge

Introduction The Role of Astronomy p. 3 Astronomical Objects of Research p. 4 The Scale of the Universe p. 7 Spherical Astronomy Spherical

2010 Pearson Education, Inc.

Dynamical water delivery: how Earth and rocky exoplanets get wet

EART164: PLANETARY ATMOSPHERES

Observational constraints from the Solar System and from Extrasolar Planets

Minimum Radii of Super-Earths: Constraints from Giant Impacts

Finding Extra-Solar Earths with Kepler. William Cochran McDonald Observatory

Exoplanet Host Stars

HD Transits HST/STIS First Transiting Exo-Planet. Exoplanet Discovery Methods. Paper Due Tue, Feb 23. (4) Transits. Transits.

Heavy meteal rules. Vardan Adibekyan Institute of Astrophysics and Space Sciences. The star-planet connection. 1 June 2015 NAOJ, Tokyo

Planets & Life. Planets & Life PHYS 214. Please start all class related s with 214: 214: Dept of Physics (308A)

Origin of the Solar System

Exoplanet Pursuit. PLANet B. Kalpaxis Georgios Vasilainas Athanasios Vatistas Andreas

Characterization of the exoplanet host stars. Exoplanets Properties of the host stars. Characterization of the exoplanet host stars

The Transit Method: Results from the Ground

Nature and Origin of Planetary Systems f p "

Three regimes of extrasolar planets inferred from host star metallicities

STELLAR MULTIPLICITY AND ITS ROLE IN EXOPLANETARY SYSTEMS

TrES Exoplanets and False Positives: Finding the Needle in the Haystack

The Solar System LEARNING TARGETS. Scientific Language. Name Test Date Hour

The stability of planets in the Alpha Centauri system

Astronomy 330 HW 2. Outline. Presentations. ! Alex Bara

HARPS-N Contributions to the Mass-Radius

arxiv: v1 [astro-ph.sr] 28 Sep 2018

Habitability Outside the Solar System. A discussion of Bennett & Shostak Chapter 11 HNRS 228 Dr. H. Geller

Chapter 13 Lecture. The Cosmic Perspective. Seventh Edition. Other Planetary Systems: The New Science of Distant Worlds Pearson Education, Inc.

Rocky Worlds Limited to 1.8 Earth Radii by Atmospheric Escape During a Star s Extreme UV Saturation

Transcription:

Title: Survival Function Analysis of Planet Orbit Semi-major Axis Distribution and Occurrence Rate Estimate Authors: Li Zeng* 1, Stein B. Jacobsen 1, Dimitar D. Sasselov 2, Andrew Vanderburg 3 Affiliations: 1 Department of Earth & Planetary Sciences, Harvard University, 20 Oxford Street, Cambridge, MA 02138. 2 Harvard-Smithsonian Center for Astrophysics, 60 Garden Street, Cambridge, MA 02138, 3 Sagan Fellow, University of Texas at Austin Correspondence to: astrozeng@gmail.com Abstract: Applying survival function analysis to planet orbital period and semi-major axis distribution from the Kepler sample, we found that all exoplanets are uniformly distributed in (Log a) or (Log P), where a is the orbit semi-major axis and P is orbital period, with an inner cut-off of closest distance to the host star. This inner cut-off is 0.04 AU for rocky worlds (1-2 RÅ), 0.08 AU for water worlds (2-4 RÅ), and 0.4 AU for gas-rich planets (>4 RÅ). Moreover, the transitional planets (4-10 RÅ) and gas giants (>10 RÅ) have a different statistical distribution inside 0.4 AU which is uniform in a. This difference in distribution is likely caused by the difference in the efficiency of planet migration in the inner disk, and the susceptibility to host stellar irradiation, for gas-poor (<4 RÅ) versus gas-rich (>4 RÅ) planets. Armed with this knowledge, and combined with the survival function analysis of planet size distribution, we can make precise estimates of planet occurrence rate and predict the TESS mission yield. Introduction: The survival function (Clauset et al., 2009; Virkar and Clauset, 2014), also known as the complimentary cumulative distribution function (ccdf), is defined as the number of planets above a given parameter (here Period P or semi-major axis a) versus that parameter, in a log-log plot, of the Kepler planet candidates: 4433 from Q1-Q17 NASA Exoplanet Archive (Akeson et al., 2013), with false positives excluded already. SF (Survival Function) = 1- CDF (Cumulative Distribution Function) = 1 - Integral of PDF (Probability Density Function) Differentiate this SF, one gets the PDF (Probability Density Function). The survival function (SF) can tell apart different distributions. Comparing to the probability density function (PDF), it has the advantage of overcoming the large fluctuations that occur in the tail of a distribution due to finite sample sizes (Clauset et al., 2009). For example, on a loglog plot of survival function, power-law distribution appears as a straight line, while normal, log- 1

normal, or exponential distributions all have a sharp upper end cut-off in x-axis. This plot is also known as the rank-frequency plot (Newman, 2005). The comparison of the SF of the data with the straight-line SF of reference power-law distributions in Figure 1-3 are the essence of the Kolmogorov-Smirnov (K-S) test. K-S statistic simply evaluates the maximum distance between the CDFs of the data and the fitted model, and can also test the goodness-of-fit. This approach identifies the boundaries separating different regimes of probabilistic distributions in the data. Discussion: Part I. Survival function analysis of the overall planet population Figure 1 shows the SF of the orbital period (P) of all 4433 Kepler planet candidates. The bulk of it is best fit to a power-law, with power index equal to -2/3. That is, SF P^(-2/3). Differentiating this SF gives the PDF, that is, PDF P^(-5/3). Thus, the likelihood dn of finding a planet around one host star in the interval of (P~P+dP) is dn P^(-5/3) dp. However, this PDF does not represent the real distribution of planets, since we have not taken into account the geometric correction and the instrumental sensitivity correction. The geometric correction has to do with that the more distant planets have less chance of transiting the planets. Assuming random orientation of orbital planes in space, this probability of transit = $ %&'()$ *+',-&. For the planets we are considering, R./0 234567 R 974:, and R 974: R 9<5, since most host stars in the Kepler catalog are solar-type FGK main-sequence stars. Therefore, the probability of transit $ %>, a^(-1) P^(-2/3). Thus, to compensate for this./0 transit probability to back out the real probability, the PDF needs to be multiplied by a factor of./0 a P^(2/3). We name the probability distribution with geometric-correction applied as $ %>, dncorrected. Therefore, dncorrected P^(-1) dp d Log P. The instrument sensitivity causes incomplete detection of planets of small radii less than a certain radius threshold. This threshold vary from 1~1.5 RÅ depending on the orbital period range considered (Fulton et al., 2017). We will discuss it in more detail later on. Figure 2 shows the SF of orbital semi-major axis (a) for all 4433 Kepler planet candidates. Applying the same analysis, SF a^(-1), PDF a^(-2), dn a^(-2) da, and dncorrected a^(-1) da d Log a. This result is in line with the earlier result, since the Kepler s third law is a^3 P^2. Therefore, we can re-write dncorrected as dncorrected d Log a d Log P. We can identify the inner cut-off of the power-law relation in both Figure 1 and Figure 2 by pinpointing the intersection of the fitted power-law line with that of the horizontal saturation line in the rank-frequency plot. This inner cut-off is P=4-day orbital period for all Kepler planet candidates average, as shown in Figure 1. Applying the Kepler s third law for 1 solar mass hoststar, the corresponding semi-major axis is (4/365.25)^(2/3) = 0.05 AU. This value is exactly the inner cut-off identified in Figure 2. This agreement acts as a check for the validity of this method. This inner cut-off of 0.05 AU suggests that on average planets cannot get much closer than this distance to the host star due to physical limitations. Considering that 1 AU = 215 solar 2

radii, then 0.05 AU 10 solar radii. Planets inside 10 solar radii or 10 stellar radii must be highly irradiated, and experience strong tidal interactions with their host stars. All these can lead to short lifetimes for these planets if they get too close. There also exists an outer cut-off of the power-law extrapolation, partly due to the limited observational duration of the Kepler mission. The Kepler telescope was operational since December 2009 until the failure of its second reaction wheel in May 2013. The entire duration of its normal operation is 3.5 year for the data concerned in this paper, not including K2. Since it generally requires three transits to confirm one planet candidate. Thus, beyond ~3.5/3.5=1 year or ~1 AU orbit, Kepler s detection becomes very incomplete, as most detections have less than three repeats and cannot be listed as candidates. This can be seen as the SF quickly falls below the power-law extrapolation beyond this outer cut-off. More conservatively and safely, this outer cut-off can be placed at ~0.7 AU or ~200-day orbital period. If this outer cut-off is corrected, then planets are expected to be uniformly-distributed in Log a or Log P all the way up to much larger orbital distances. This expectation is in accordance with the Titius-Bode Law, which is a modified power-law giving rise to uniform distribution in Log a at large a. As in our own solar system, planets are approximately uniformly-distributed in each logarithmic-bin of semi-major axis a or orbital period P beyond the orbit of planet Mercury. Although typical Kepler planet systems are much more compact, about one-order-of-magnitude than our own solar system, the relative spacings of planets are similar. It can be explained by similar initial separations of planet forming zones in the nebula but different degrees of subsequent migrations. Furthermore, if the compactness of Kepler systems can be explained by planet migration, then this migration is likely self-similar, meaning that it preserves the relative spacings of the planet system to a large extent. There were 145,000 main-sequence stars, mostly sun-like FGK stars, being continuously monitored by the Kepler mission over its 3.5-year duration of normal operation. Out of them about 4000 planet candidates with radii larger than 1 Earth radius (>1 RÅ) were identified. With this information, we can back out the absolute occurrence rate of planets (>1 RÅ) around solartype stars. Previously, we derive: dncorrected = A d Log a, where dncorrected represents the average number of planets per host star within a given interval of semi-major axis (a~a+da), with the geometric factor of transit corrected. Here A is introduced as a dimensionless normalization factor. We can then write the average observed number of planets per host star within a given interval of semi-major axis (a~a+da) as: dnobs A $ %>, d Log a= A $ %>, B4../4./4 4 Now, total number of planets (Nobs) observed per host star is an integral of dnobs from the inner cut-off (ain 0.05 AU) to the outer cut-off (aout 1 AU): 4 H>& dn FG9 4 H>& JKLM Nobs = A $ %>, 4 I, 4 I, JN.NPLM B4./4 4 = A $ %>,./ Q K 4 I, K 4 H>& S A $ %>,./ K 4 I, Equating this to the actual number of planets observed, we can find A as: 3

T,NNN KTP,NNN = Nobs= L $ %>,./4 I, = A K./ K.KP N.NP A (YKZ ) =./.KP (K/N.NP) T,NNN KTP,NNN 1.86 Remember that this is the normalization factor A for planets (>1 RÅ), without the instrument sensitivity correction. If corrected for the pipeline incompleteness of smallest planets due to instrument sensitivity, A( 1 RÅ) 2-2.5. This result suggests that, beyond the inner cut-off of 0.05 AU, one expects to find on average about 2 planets with radii >1 RÅ, per unit natural-logarithmic bin of a, per star, all the way up to much larger orbital distances. This overall population of planets is dominated by small planets in between 1 RÅ and 4 RÅ. Part II. Survival function analysis of separate planet groups This analysis can be applied to planets divided into four radius bins (1-2 RÅ, 2-4 RÅ, 4-10 RÅ, and >10 RÅ), where the bins are identified through a model-independent survival function analysis of planet radii in a separate paper (Zeng et al., 2017). The results are shown in Figure 3. The most prominent feature is that small (gas-poor) exoplanets (<4 RÅ) have a different powerlaw dependence on orbital semi-major axis (a) than the large (gas-rich) exoplanets (>4 RÅ) within 0.4 AU: For small exoplanets (1-4 RÅ), dncorrected a ak.. da, where the power-index can be approximated as -1 for the simplicity of calculation, and considering the correction for the instrument sensitivity where the pipeline incompleteness contours go as ~a N.c (Fulton et al., 2017). For large exoplanets (>4 RÅ), dncorrected a an.tp da, where the power-index can be approximated as -1/2 for the simplicity of calculation. This difference in power-law dependence within 0.4 AU can be a result of different planetmigration mechanism and/or susceptibility to host stars influence, for small gas-poor (<4 RÅ) planets and large gas-rich (>4 RÅ) planets inside 0.4 AU. For gas-rich planets, 0.4 AU is their first cut-off, which causes the slope to change from -1 to -1/2. This first cut-off, unlike that of small planets, is incomplete, so that a fraction of them can still approach their host stars to extreme proximity (0.02 AU or about 4 solar or stellar radii from the center of the star). The transitional planets (4-10 RÅ) exhibits a transitional piece-wise power-law behavior: Inside 0.4 AU, they approach the power-law dependence with that of the gas-giants (>10 RÅ). Outside 0.4 AU, they approach the power-law dependence with that of small exoplanets (<4 RÅ). This sudden change of slope is clearly seen at 0.4 AU for gas-giants (>10 RÅ). Although, the counts of planets beyond 0.4 AU are relatively small and we need more planets from the upcoming TESS mission to confirm this behavior. Planets in different radius bins have different inner cut-offs. The inner cut-off of planets of (2-4 RÅ) is 0.086 AU, which is twice that of the inner cut-off of planets of (1-2 RÅ): 0.043 AU. Meanwhile, they share the same power-law dependence beyond their inner cut-offs. Correcting for the geometric probability of transit, even though there appears 4

to be about equal number of planet candidates observed for each of the two bins, the number of planets of (2-4 RÅ) is about twice the number of planets of (1-2 RÅ). Then correcting for pipeline incompleteness of planets between 1-1.5 RÅ (Fulton et al., 2017), these two types: (1-2 RÅ) versus (2-4 RÅ) can be approximately equal in number in the end. This supports our interpretation of planets of (1-2 RÅ) as rocky worlds, and planets of (2-4 RÅ) as water worlds that are similar in their rocky component to the rocky worlds, but possess an outer envelope of cosmic ice component dominated by H2O-ice which is about the same mass as the rocky component according to the cosmic elemental abundance (L. Zeng et al., 2018). Now, we can apply the same analysis in Part I to back out the absolute occurrence rate of different planet types around solar-type FGK main-sequence stars: (1-2 RÅ) rocky worlds: inner cut-off ain 0.04 AU, dncorrected A (Ka. Z ) a ak da, integer power-index -1 is used here for convenience of calculation. Following the same line of argument in Part I, we have: 2π 215 1,728 A (Ka. Z ) Q 1 0.04 S 145,000 0.6 When instrument sensitivity correction is made, A (Ka. Z ) canu be brought to a value close to 1. That is, on average, we expect to find about 1 planet of (1-2 RÅ) per unit natural-logarithmic interval of semi-major axis (a) outside the inner cut-off of 0.04 AU. (2-4 RÅ) water worlds: inner cut-off ain 0.08 AU, dncorrected A (.at Z ) a ak da, integer power-index -1 is used here for convenience of calculation. Following the same line of argument in Part I, we have: A (.at Z ) 2π 215 1,559 Q 1 0.08 S 145,000 1.1 Thus, similarly, we expect to find about 1 planet of (2-4 RÅ) per unit natural-logarithmic interval of semi-major axis (a) outside the inner cut-off of 0.08 AU. There is no detection incompleteness due to instrument sensitivity for planets with radii > 2 RÅ. The planets of (2-4 RÅ) are on average twice the distances from the host stars than the planets of (1-2 RÅ). This can be indicative of their initial formation distances that planets of (2-4 RÅ) are formed beyond the snowline and planets of (1-2 RÅ) are formed inside snowline. For planets of (4-10 RÅ) and (>10 RÅ), since they have a different power-law dependence on semi-major axis (a) inside 0.4 AU, we need to apply a different normalization. Let the normalization factor be B: (here B has the dimension of 1/ AU) dncorrected B d( a) So the average observed number of planets per host star within a given interval of semi-major 5

axis ain and aout is: 4 H>& 4 H>& $ Nobs = dn 4 FG9 %>, dn $ I, 4./4 mf::6m76b = %>, B d a = B $ %>, n K I, 4 I,./4./ K o4 H>& p (4-10 RÅ) transitional planets: ain 0.05 AU, aout 0.4 AU, 2π 215 358 150 B (TaKN Z ) n 1 0.05 1 0.3 p 145,000 0.7 1 AU (>10 RÅ) gas giants: ain 0.02 AU, aout 0.4 AU, B (YKN Z ) 4 H>& 2π 215 300 77 n 1 0.02 1 0.4 p 145,000 0.4 1 AU So the average number of planets per host star with geometric correction within aout is: o4 I, Thus, Ncorrected (<a F<7 ) B (oa F<7 oa s5 ) N tuvvwtxwy,(takn Z ) (< 0.4AU) B (TaKN Z ) } 0.4AU 0.05AU~ 0.3 N tuvvwtxwy,(ykn Z ) (< 0.4AU) B (YKN Z ) } 0.4AU 0.02AU~ 0.2 In summation, there is about (0.3+0.2)=0.5=1/2 planet of (>4 RÅ) per host star within 0.4 AU. Thus, large planets (>4 RÅ) are indeed much rarer than small planets (<4 RÅ) within 0.4 AU. Beyond 0.4 AU, both planets of (4-10 RÅ) and (>10 RÅ) show similar power-law dependence on semi-major axis (a) as planets of (1-2 RÅ) and (2-4 RÅ). This suggests that dncorrected d Log a is likely the primordial/generic configuration of a planet system from formation, regardless of planet sizes. When planet migration occurs, large planets (>4 RÅ) have difficulty to reach inside 0.4 AU since they are gas-rich. Beyond 0.4 AU, we could derive the normalization factor A for planets of (4-10 RÅ) and (>10 RÅ) where dncorrected A a ak da: 2π 215 A (TaKN Z ) Q 1 0.3 S A (YKN Z ) 150 145,000 0.4 2π 215 77 Q 1 0.4 S 145,000 0.3 In summary, the intrinsic relative abundance ratio of the four planet types is: A (Ka. Z ) : A (.at Z ) : A (TaKN Z ) : A (YKN Z ) = (0.6) 1.1 0.4 0.3 6

Where (0.6) is in parentheses because it is an under-estimate subject to the detection incompleteness of smallest exoplanets. The actual value should be close to 1. Thus, the completeness-corrected abundance ratio should be: A (Ka. Z ) : A (.at Z ) : A (TaKN Z ) : A (YKN Z ) 1 1 1 3 1 4 Conclusion Planets are intrinsically uniformly-distributed in Log a or Log P, as dncorrected A d Log a, with A as a dimensionless normalization constant and an inner cut-off ain. For the overall planet population of (>1 RÅ), A( 1 RÅ) 2-2.5 and ain 0.05 AU. If dividing planets into different regimes according to their radii as (Zeng et al., 2017): All planets (>1 R_earth) Small Planets (Gas-poor) (<4 R_earth) Large Planets (Gas-rich) (>4 R_earth) Rocky Planets (<2 R_earth) Water Worlds (2-4 R_earth) Transitional Planets (4-10 R_earth) Gas Giants (>10 R_earth) Then, (1-2 RÅ) rocky planets: A (Ka. Z ) 0.6 (~1 after completeness correction), ain 0.04 AU (2-4 RÅ) water worlds: A (.at Z ) 1.1, ain 0.08 AU (4-10 RÅ) transitional planets: A (TaKN Z ) 0.4, ain 0.4 AU (>10 RÅ) gas giants: A (YKN Z ) 0.3, ain 0.4 AU inside 0.4 AU, large planets (>4 RÅ) shows a different distribution: dncorrected B d( a) (4-10 RÅ) transitional planets: B (TaKN Z ) 0.7/ AU, ain 0.05 AU (>10 RÅ) gas giants: B (YKN Z ) 0.4/ AU, ain 0.02 AU In general, large planets (>4 RÅ) have difficulty to come closer to their host stars than 0.4 AU. This is manifested by the first cut-off or change of slope in the SF at ~0.4 AU. This supports our identification of small planets (<4 RÅ) as gas-poor and large planets (>4 RÅ) as gas-rich (L. Zeng et al., 2018). This first cut-off is soft or incomplete, so that a fraction of large planets can come very close to their host stars (0.02~0.05 AU) to form warm neptunes and hot jupiters. On average, there is about 1/2 planet of (>4 RÅ) within 0.4 AU around a typical FGK-star. Comparatively, there are 3~4 planets of (<4 RÅ) within 0.4 AU around a typical FGK-star. Small exoplanets (<4 RÅ) have sharp inner cut-offs of closest distances to their host stars. Water 7

worlds (2-4 RÅ) has inner cut-off at ~0.08 AU (~20 solar radii away, or about 1000 K surface equilibrium temperature with Earth-like albedo), which is about twice the distance as the rocky worlds (1-2 RÅ) inner cut-off at ~0.04 AU (~10 solar radii away, or about 1400 K surface equilibrium temperature with Earth-like albedo). The water worlds (2-4 RÅ) and rocky worlds (1-2 RÅ), if corrected for detection incompleteness, likely have comparable occurrence rate, that is, one expects to find one of each, per unit natural-logarithmic interval of semi-major axis (a), per host star. Armed with this knowledge, and combined with the survival function of planet size distribution (Zeng et al., 2017), we can make predictions for the TESS mission yield. References: Akeson, R.L., Chen, X., Ciardi, D., Crane, M., Good, J., Harbut, M., Jackson, E., Kane, S.R., Laity, A.C., Leifer, S., Lynn, M., Mcelroy, D.L., Papin, M., Plavchan, P., Ramírez, S. V, Rey, R., Braun, K. Von, Wittman, M., Abajian, M., Ali, B., Beichman, C., Beekley, A., Berriman, G.B., Berukoff, S., Bryden, G., Chan, B., Groom, S., Lau, C., Payne, A.N., Regelson, M., Saucedo, M., Schmitz, M., Stauffer, J., Wyatt, P., Zhang, A.A., 2013. The NASA Exoplanet Archive: Data and Tools for Exoplanet Research. Publ. Astron. Soc. Pacific 125, 989. Clauset, A., Rohilla Shalizi, C., J Newman, M.E., 2009. Power-Law Distributions in Empirical Data. Soc. Ind. Appl. Math. Rev. 661 703. Fulton, B.J., Petigura, E.A., Howard, A.W., Isaacson, H., Marcy, G.W., Cargile, P.A., Hebb, L., Weiss, L.M., Johnson, J.A., Morton, T.D., Sinukoff, E., Crossfield, I.J.M., Hirsch, L.A., 2017. The California- Kepler Survey. III. A Gap in the Radius Distribution of Small Planets. Astron. J. 154, 109. https://doi.org/10.3847/1538-3881/aa80eb Newman, M.E.J., 2005. Power laws, Pareto distributions and Zipf s law. Contemp. Phys. 46, 323 351. https://doi.org/10.1080/00107510500052444 Virkar, Y., Clauset, A., 2014. Power-law distributions in binned empirical data. Ann. Appl. Stat. 8, 89 119. https://doi.org/10.1214/13-aoas710 Zeng, A.L., Jacobsen, S.B., Sasselov, D.D., Vanderburg, A., Lopez-Morales, M., Perez- Mercader, J., Petaev, M.I., Mattsson, T.R., 2018. Growth Model Interpretation of Planet Size Distribution. Submitt. to Nat. 1 15. Zeng, L., Jacobsen, S.B., Sasselov, D.D., Vanderburg, A., 2017. Survival Function Analysis of Planet Size Distribution. Submitt. to MNRAS. Acknowledgements: This work was partly supported by a grant from the Simons Foundation (SCOL [award #337090] to L.Z.). Part of this research was also conducted under the Sandia Z Fundamental Science Program and supported by the Department of Energy National Nuclear Security Administration under Award Numbers DE-NA0001804 and DE-NA0002937 to S. B. Jacobsen (PI) with Harvard University. This research is the authors views and not those of the DOE. Sandia National Laboratories is a multimission laboratory managed and operated by National Technology and Engineering Solutions of Sandia, LLC., a wholly owned subsidiary of Honeywell International, Inc., for the U.S. Department of Energy s National Nuclear Security Administration under contract DE-NA-0003525. 8

# Kepler planets with Period > P 5000 2000 1000 500 200 4-day 9.3-0.66 x 1 5 10 50 100 Planet Orbital Period P (days) Figure 1. Survival function of planet orbital period P (days) for Kepler planet candidates (4433 from Q1-Q17 NASA Exoplanet Archive (Akeson et al., 2013), false positives excluded already). The best fit to power-law is in natural-logarithm shown as the red line. The slope of this power-law is exactly 2/3 in log-log plot. The inner cut-off is 4-day orbital period, as identified by the intersection of the extrapolation of the power-law with the horizontal line representing the total number of planets in this case. # Kepler planets with Semi-major axis > a 5000 2000 1000 500 200 0.05 AU 5.4 - x 0.01 0.05 0.10 0.50 1 Orbital Separation a (AU) Figure 2. Survival function of planet semi-major axis a (AU) for Kepler planet candidates (4433 from Q1-Q17 NASA Exoplanet Archive (Akeson et al., 2013), false positives excluded already). The best fit to power-law is in natural-logarithm shown as the green line. The inner cut-off is identified by the intersection of the extrapolation of the power-law with the horizontal line representing the total number of planets in this case. It is 0.05 AU, corresponding to exactly 4-day orbital period for a planet around 1 solar mass star, as expected, because most host stars in Kepler sample are solar type FGK stars with masses close to 1 solar mass. 4433 4433 9

1728 1559 1000 Survival Function of Semi-major Axis > a 500 100 50 358 300 200 150 100 77 1-2 R 2-4 R 0.020 AU 4.5-0.45 x 0.043 AU 3.55-1.2 x 0.086 AU 3.7-1.2 x 3.94-0.45 x 0.3 AU 0.4 AU 3.25-1.2 x 4.4-1.2 x 4-10 R >10 R 10 0.01 0.05 0.10 0.50 1 Orbital Separation a (AU) Figure 3. Survival function analysis of Kepler planet candidates (4433 from the Q1-Q17 NASA Exoplanet Archive (Akeson et al., 2013), false positives excluded already), divided in four radius bins: red: 1-2 RÅ, cyan: 2-4 RÅ, green: 4-10 RÅ, and blue: >10 RÅ. This classification scheme based on planet radii is adopted from the survival function analysis of planet radius distribution (Zeng et al., 2017). The straight lines are power-law fits expressed in natural logarithm. 10