Worst-Case Execution Time Analysis. LS 12, TU Dortmund
|
|
- Hubert Dorsey
- 5 years ago
- Views:
Transcription
1 Worst-Case Execution Time Analysis Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 09/10, Jan., 2018 Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 1 / 43
2 Most Essential Assumptions for Real-Time Systems Upper bound on the execution times: Commonly, called the Worst-Case Execution Time(WCET) Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 2 / 43
3 What does Execution Time Depend on Input parameters Algorithm parameters Problem size etc. Initial states and intermediate states of the system for executing Cache configuration, replacement policies Pipelines Speculations etc. Interferences from the environment Scheduling Interrupts etc. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 3 / 43
4 How to Derive the Worst-Case Execution Time (WCET) Most of industry s best practice Measure it: determine WCET directly by running or simulating a set of inputs. There is no guarantee to give an upper bound of the WCET. The derived WCET could be too optimistic. Exhaustive execution: by considering the set of all the possible inputs In general, not possible The inputs have to cover all the possible initial states and intermediate states of the system, which is also usually not possible. Compute it In general, not possible neither, as computing (tight) WCET for a program is uncomputable by Turing machines. Based on some structures, it is possible and the derived solution is a safe upper bound of the WCET. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 4 / 43
5 Why is It Uncomputable? Halting Problem Given the description of a Turing machine m and its input x, the problem is to answer whether the machine halts on x. Theorem Halting Problem is undecidable (uncomputable). In other words, one cannot use an algorithm to decide whether another algorithm m halts on a specific input. WCET is undecidable It is even undecidable if it terminates at all. Deriving the WCET is of course of undecidable. Please refer to the textbook of Computational Complexity by Prof. Papadimitriou. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 5 / 43
6 Execution Time Distribution Our objectives: Upper bound of execution time as tightly as possible. All control-flow paths, by considering all possible inputs. All paths through the architecture, resulting from the potential initial and assumed intermediate architectural states. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 6 / 43
7 Timing Analysis By considering systems, in general, with finite architectural configurations, finite input domains, and bounded loops and recursion, WCET is computable. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 7 / 43
8 Timing Analysis By considering systems, in general, with finite architectural configurations, finite input domains, and bounded loops and recursion, WCET is computable. But..., the search space is too large to explore it exhaustively! Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 7 / 43
9 Why is It Hard for Analyzing WCET? Execution time e(i) of machine instruction i In the good old time: e(i) is a constant c, which could be found in the data sheet Nowadays, especially for high-performance processors: e(i) also depends on the (architectural) execution state s. min{e(i, s) s S} e(i) max{e(i, s) s S}, where S is the set of all states. Using max{e(i, s) s S} is safe for WCET, but might be not tight since some states in S might not be possible reached by some inputs. Execution history, resulting in a smaller set of reachable execution states, has to be enforced to improve the tightness of the analysis. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 8 / 43
10 Variability of Execution Times Wilhelm et al. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 9 / 43
11 Timing Accidents and Penalties Timing Accident: cause for an increase of the execution time of an instruction Timing Penalty: the associated increase Types of timing accidents Cache misses Pipeline stalls Branch mispredictions Bus collisions Memory refresh of DRAM TLB miss Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 10 / 43
12 Overall Approach: Modularization Architecture Analysis: Use Abstract Interpretation. Exclude as many Timing Accidents as possible during analysis. Certain timing accidents will never happen, e.g., at a certain program point, instruction fetch will never cause a cache miss. The more accidents excluded, the lower (better) the upper bound. Determine WCET for basic blocks, based on context information. Worst-Case Path Determination: Map control flow graph to an Integer Linear Program (ILP). Determine upper bound and associated path. High-Level Objectives: Upper Bound of WCET It must be safe, i.e., not underestimate. It should be tight, i.e., not far away from real WCET. The analysis effort must be tolerable. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 11 / 43
13 Overall Structure Executable Binary Program Control Flow Graph (CFG) Reconstruction Loop Analysis and Unfolding Loop Bounds Static Analysis Value Analyzer Micro Architecture Abstraction Path Analysis ILP Generator Cache/Pipeline Analyzer Timing Information ILP Solver Micro architecture Analysis WCET Visualization and Analysis Evaluation Worst Case Path Analysis Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 12 / 43
14 Outline Introduction Program Path Analysis Static Analysis Value Analysis Cache Analysis Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 13 / 43
15 Control Flow Graph (CFG) Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 14 / 43
16 Basic Blocks Definition: A basic block is a sequence of instructions where the control flow enters at the beginning and exits at the end, in which it is highly amenable to analysis. a[0] := b[0] + c[0] a[1] := b[3] + c[3] a[2] := b[6] + c[6] d := a[0] a[1] e := d/a[2] if e < 10 goto L Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 15 / 43
17 Basic Blocks Definition: A basic block is a sequence of instructions where the control flow enters at the beginning and exits at the end, in which it is highly amenable to analysis. Determining the basic blocks Beginning: the first instruction targets of un/conditional jumps instructions that follow un/conditional jumps a[0] := b[0] + c[0] a[1] := b[3] + c[3] a[2] := b[6] + c[6] d := a[0] a[1] e := d/a[2] if e < 10 goto L Ending: the basic block consists of the block beginning and runs until the next block beginning (exclusive) or until the program ends Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 15 / 43
18 Program Path Analysis Problem: Which sequence of instructions is executed in the worst case (i.e., the longest execution time)? Input: Timing information for each basic block, derived from static analysis (value/cache/pipeline analysis) Loop bounds by specification CFG derived from the executable binary program Basic Concept: Transform structure of CFG into a set of (integer) linear equations Solution of the Integer Linear Program (ILP) yields bound on the WCET. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 16 / 43
19 Program Path Analysis: Formal Definition Input A CFG with N basic blocks, in which each basic block B i has a worst-case execution time c i, given by static analysis. Output Suppose that each block B i is executed exactly x i times. What is the worst-case execution time WCET = N c i x i, i=1 such that the values of x i s satisfy the structural constraints in the CFG? Note that additional constraints provided by the programmer (bounds for loop counters, etc.) can also be included. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 17 / 43
20 Example for CFG Constraints d9 d1 s=k; d2 while (k < 20) d3 if (ok) d4 d5 j++; j=0; ok=true; d8 Flow equations: (x i is a variable) d 1 = d 2 = x 1 d 3 + d 9 = d 2 + d 8 = x 2 d 4 + d 5 = d 3 = x 3 d 6 + d 7 = d 8 = x 4 d 4 = d 6 = x 5 d 5 = d 7 = x 6 d6 k++; d7 r=j d10 Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 18 / 43
21 Example for Additional Constraints d9 d1 s=k; d2 while (k < 20) d3 if (ok) d4 d5 j++; j=0; ok=true; d6 d7 k++; d8 The loop is executed for at most 20 times when k is initialized with a non-negative number: x 3 20x 1. The basic block for j = 0; ok = true; is executed for at most one time: r=j d10 x 6 x 1. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 19 / 43
22 WCET: ILP Formulation maximize N c i x i i=1 such that d 1 = 1 j in(b i ) d j = k out(b j ) d k = x i, additional linear constraints i = 1,..., N Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 20 / 43
23 Outline Introduction Program Path Analysis Static Analysis Value Analysis Cache Analysis Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 21 / 43
24 Overall Structure Executable Binary Program Control Flow Graph (CFG) Reconstruction Loop Analysis and Unfolding Loop Bounds Static Analysis Value Analyzer Micro Architecture Abstraction Path Analysis ILP Generator Cache/Pipeline Analyzer Timing Information ILP Solver Micro architecture Analysis WCET Visualization and Analysis Evaluation Worst Case Path Analysis Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 22 / 43
25 Outline Introduction Program Path Analysis Static Analysis Value Analysis Cache Analysis Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 23 / 43
26 Value Analysis: Motivation and Method Motivation Provide access information to data-cache/pipeline analysis Detect infeasible paths Derive loop bounds Method Calculate intervals at all program points By considering addresses, register contents, local and global variables. Abstract Interpretation Perform the program s computation using value descriptions or abstract values in place of the concrete values. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 24 / 43
27 Outline Introduction Program Path Analysis Static Analysis Value Analysis Cache Analysis Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 25 / 43
28 Overall Structure Executable Binary Program Control Flow Graph (CFG) Reconstruction Loop Analysis and Unfolding Loop Bounds Static Analysis Value Analyzer Micro Architecture Abstraction Path Analysis ILP Generator Cache/Pipeline Analyzer Timing Information ILP Solver Micro architecture Analysis WCET Visualization and Analysis Evaluation Worst Case Path Analysis Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 26 / 43
29 Caches: Fast Memory to Deal with the Memory Wall How they work: dynamically managed by replacement policy hit [ab] CPU Cache Main Memory Capacity: Latency: 32 KB 3 cycles 2 MB 100 cycles Why they work: principle of locality spatial temporal Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 27 / 43
30 Caches: Fast Memory to Deal with the Memory Wall How they work: dynamically managed by replacement policy hit [ab] a 1 CPU Cache Main Memory Capacity: Latency: 32 KB 3 cycles 2 MB 100 cycles Why they work: principle of locality spatial temporal Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 27 / 43
31 Caches: Fast Memory to Deal with the Memory Wall How they work: dynamically managed by replacement policy a 1! hit [ab] CPU Cache Main Memory Capacity: Latency: 32 KB 3 cycles 2 MB 100 cycles Why they work: principle of locality spatial temporal Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 27 / 43
32 Caches: Fast Memory to Deal with the Memory Wall How they work: dynamically managed by replacement policy c 3? miss [ab] CPU Cache Main Memory Capacity: Latency: 32 KB 3 cycles 2 MB 100 cycles Why they work: principle of locality spatial temporal Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 27 / 43
33 Caches: Fast Memory to Deal with the Memory Wall How they work: dynamically managed by replacement policy miss [ab] c 3? c? CPU Cache Main Memory Capacity: Latency: 32 KB 3 cycles 2 MB 100 cycles Why they work: principle of locality spatial temporal Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 27 / 43
34 Caches: Fast Memory to Deal with the Memory Wall How they work: dynamically managed by replacement policy miss [ac] c 3? c = c 1 c 2 c 3 c 4! CPU Cache Main Memory Capacity: Latency: 32 KB 3 cycles 2 MB 100 cycles Why they work: principle of locality spatial temporal Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 27 / 43
35 Caches: Fast Memory to Deal with the Memory Wall How they work: dynamically managed by replacement policy c 3! miss [ac] CPU Cache Main Memory Capacity: Latency: 32 KB 3 cycles 2 MB 100 cycles Why they work: principle of locality spatial temporal Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 27 / 43
36 Caches: Fast Memory to Deal with the Memory Wall How they work: dynamically managed by replacement policy c 4? hit [ac] CPU Cache Main Memory Capacity: Latency: 32 KB 3 cycles 2 MB 100 cycles Why they work: principle of locality spatial temporal Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 27 / 43
37 Caches: Fast Memory to Deal with the Memory Wall How they work: dynamically managed by replacement policy c 4! hit [ac] CPU Cache Main Memory Capacity: Latency: 32 KB 3 cycles 2 MB 100 cycles Why they work: principle of locality spatial temporal Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 27 / 43
38 Fully Associative Caches B log2 (s) Address: b log2 (8 b) Block offset Tag Tag Tag... Data Block Data Block Tag Data Block =? Yes: Hit! k = associativity No: Miss! MUX Data Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 28 / 43
39 k s k s s Set-Associative Caches log2 (s) Address: Tag Index B log log22(s) (8 b)log2 (8 b) b Block offset Cache Set: Tag Tag Tag log2 (s) Special cases:... Data Block Data Block Cache Set:... Data Block log2 (8 b) Tag Tag... Data Block Data Block Tag Data Block =? Yes: Hit! k s MUX No: Miss! Data direct-mapped cache: only one line per cache set fully-associative cache: only one cache set Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 29 / 43
40 Replacement Policies Least-Recently-Used (LRU) used in Intel Pentium I and MIPS 24K/34K First-In First-Out (FIFO or Round-Robin) used in Motorola PowerPC 56x, Intel XScale, ARM9, ARM11 Pseudo-LRU (PLRU) used in Intel Pentium II-IV and PowerPC 75x Most Recently Used (MRU) as described in literature Each cache set is treated independently: Set-associative caches are compositions of fully-associative caches. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 30 / 43
41 Cache Analysis for LRU Two types of cache analyses: 1 Local guarantees: classification of individual accesses Must-Analysis Underapproximates cache contents May-Analysis Overapproximates cache contents 2 Global guarantees: bounds on cache hits/misses Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 31 / 43
42 Challenges for Cache Analysis read z Always a cache hit/always a miss? read y read x write z Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 32 / 43
43 Challenges for Cache Analysis read z Always a cache hit/always a miss? read y read x 1. Initial cache contents unknown. 2. Different paths lead to these points. write z 3. Cannot resolve address of z. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 32 / 43
44 Using Abstract Interpretation read z Collecting Semantics = set of states at each program point that any execution may encounter there read y read x Two approximations: Collecting Semantics uncomputable write z Cache Semantics computable γ(abstract Cache Sem.) efficiently computable Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 33 / 43
45 Using Abstract Interpretation read z Collecting Semantics = set of states at each program point that any execution may encounter there read y read x Two approximations: Collecting Semantics uncomputable write z Cache Semantics computable γ(abstract Cache Sem.) efficiently computable Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 33 / 43
46 Using Abstract Interpretation read z Collecting Semantics = set of states at each program point that any execution may encounter there read y read x Two approximations: Collecting Semantics uncomputable write z Cache Semantics computable γ(abstract Cache Sem.) efficiently computable Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 33 / 43
47 Using Abstract Interpretation read z Collecting Semantics = set of states at each program point that any execution may encounter there read y read x Two approximations: Collecting Semantics uncomputable write z Cache Semantics computable γ(abstract Cache Sem.) efficiently computable Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 33 / 43
48 Using Abstract Interpretation read z Collecting Semantics = set of states at each program point that any execution may encounter there read y read x Two approximations: Collecting Semantics uncomputable write z Cache Semantics computable γ(abstract Cache Sem.) efficiently computable Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 33 / 43
49 Least-Recently-Used (LRU): Concrete Behavior s Cache Miss : z y x t s z y x LRU has notion of age s Cache Hit : z y s t s z y t Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 34 / 43
50 LRU: Must-Analysis: Abstract Domain Used to predict cache hits. Maintains upper bounds on ages of memory blocks. Upper bound associativity memory block definitely cached. Example Abstract state: {x} age 0 {s,t} age 3... and its interpretation: Describes the set of all concrete cache states in which x, s, and t occur, x with an age of 0, s and t with an age not older than 2. γ([{x},, {s, t}, ]) = {[x, s, t, a], [x, t, s, a], [x, s, t, b],...} Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 35 / 43
51 Sound Update Local Consistency Abstract Update (must) (must ) γ Lifted Concrete Update γ concrete cache states concrete cache states Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 36 / 43
52 LRU: Must-Analysis: Update Potential Cache Miss : Definite Cache Hit : {x} {s,t} {x} {s,t} z s {z} {x} {s,t} {s} {x} {t} Why does not t age in the second case under Must-Cache analysis? Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 37 / 43
53 LRU: Must-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {c,f} {e} {a} {a,c} Intersection + Maximal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 38 / 43
54 LRU: Must-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {c,f} {e} {a} {a,c} Intersection + Maximal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 38 / 43
55 LRU: Must-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {c,f} {e} {a} {a,c} Intersection + Maximal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 38 / 43
56 LRU: Must-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {c,f} {e} {a} {a,c} Intersection + Maximal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 38 / 43
57 LRU: Must-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {c,f} {e} {a} {a,c} Intersection + Maximal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 38 / 43
58 LRU: Must-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {c,f} {e} {a} {a,c} Intersection + Maximal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 38 / 43
59 LRU: Must-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {c,f} {e} {a} {a,c} Intersection + Maximal Age How many memory blocks can be in the must-cache? Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 38 / 43
60 LRU: May-Analysis: Abstract Domain Used to predict cache misses. Maintains lower bounds on ages of memory blocks. Lower bound associativity memory block definitely not cached. Abstract state: {x,y} age 0 {s,t} {u} age 3 and its interpretation: Describes a set of all concrete cache states, where no memory blocks except x, y, s, t, and u occur, x and y with an age of at least 0, s and t with an age of at least 2, u with an age of at least 3. γ([{x, y},, {s, t}, {u}]) = {[x, y, s, t], [y, x, s, t], [x, y, s, u],...} Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 39 / 43
61 LRU: May-Analysis: Update Definite Cache Miss : Potential Cache Hit : {x} {s,t} {y} {x} {s,t} {y} z s {z} {x} {s,t} {s} {x} {y,t} Why does t age in the second case? Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 40 / 43
62 LRU: May-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {a,c} {c,f} {e} {a} = {e} {f} Union + Minimal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 41 / 43
63 LRU: May-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {a,c} {c,f} {e} {a} = {e} {f} Union + Minimal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 41 / 43
64 LRU: May-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {a,c} {c,f} {e} {a} = {e} {f} Union + Minimal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 41 / 43
65 LRU: May-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {a,c} {c,f} {e} {a} = {e} {f} Union + Minimal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 41 / 43
66 LRU: May-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {a,c} {c,f} {e} {a} = {e} {f} Union + Minimal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 41 / 43
67 LRU: May-Analysis: Join Need to combine information where control-flow merges. Join should be conservative: γ(a) γ(a B) γ(b) γ(a B) {a} {c} {a,c} {c,f} {e} {a} = {e} {f} Union + Minimal Age Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 41 / 43
68 Summary of WCET Analysis Value analysis Cache analysis using statically computed effective addresses and loop bounds Pipeline analysis Model CPU as a big state machine. There have been some new results published in 2015 from Prof. Jan Reineke s group from Uni des Saarlandes. Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 42 / 43
69 ait-tool Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 43 / 43
Worst-Case Execution Time Analysis. LS 12, TU Dortmund
Worst-Case Execution Time Analysis Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 02, 03 May 2016 Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 1 / 53 Most Essential Assumptions for Real-Time Systems Upper
More informationCaches in WCET Analysis
Caches in WCET Analysis Jan Reineke Department of Computer Science Saarland University Saarbrücken, Germany ARTIST Summer School in Europe 2009 Autrans, France September 7-11, 2009 Jan Reineke Caches in
More informationTiming analysis and timing predictability
Timing analysis and timing predictability Caches in WCET Analysis Reinhard Wilhelm 1 Jan Reineke 2 1 Saarland University, Saarbrücken, Germany 2 University of California, Berkeley, USA ArtistDesign Summer
More informationDesign and Analysis of Time-Critical Systems Cache Analysis and Predictability
esign and nalysis of Time-ritical Systems ache nalysis and Predictability Jan Reineke @ saarland university ES Summer School 2017 Fiuggi, Italy computer science n nalogy alvin and Hobbes by Bill Watterson
More informationToward Precise PLRU Cache Analysis
Toward Precise PLRU Cache Analysis Daniel Grund Jan Reineke 2 Saarland University, Saarbrücken, Germany 2 University of California, Berkeley, USA Workshop on Worst-Case Execution-Time Analysis 2 Outline
More informationEmbedded Systems 23 BF - ES
Embedded Systems 23-1 - Measurement vs. Analysis REVIEW Probability Best Case Execution Time Unsafe: Execution Time Measurement Worst Case Execution Time Upper bound Execution Time typically huge variations
More informationDesign and Analysis of Time-Critical Systems Response-time Analysis with a Focus on Shared Resources
Design and Analysis of Time-Critical Systems Response-time Analysis with a Focus on Shared Resources Jan Reineke @ saarland university ACACES Summer School 2017 Fiuggi, Italy computer science Fixed-Priority
More informationICS 233 Computer Architecture & Assembly Language
ICS 233 Computer Architecture & Assembly Language Assignment 6 Solution 1. Identify all of the RAW data dependencies in the following code. Which dependencies are data hazards that will be resolved by
More informationA Definition and Classification of Timing Anomalies
Definition and Classification of Timing nomalies Jan Reineke 1, Björn Wachter 1, Stephan Thesing 1, Reinhard Wilhelm 1, Ilia Polian 2, Jochen Eisinger 2, and Bernd Becker 2 1 Saarland University Im Stadtwald
More informationTiming analysis and predictability of architectures
Timing analysis and predictability of architectures Cache analysis Claire Maiza Verimag/INP 01/12/2010 Claire Maiza Synchron 2010 01/12/2010 1 / 18 Timing Analysis Frequency Analysis-guaranteed timing
More informationTask Models and Scheduling
Task Models and Scheduling Jan Reineke Saarland University June 27 th, 2013 With thanks to Jian-Jia Chen at KIT! Jan Reineke Task Models and Scheduling June 27 th, 2013 1 / 36 Task Models and Scheduling
More informationEvaluation and Validation
Evaluation and Validation Jian-Jia Chen (Slides are based on Peter Marwedel) TU Dortmund, Informatik 12 Germany Springer, 2010 2016 年 01 月 05 日 These slides use Microsoft clip arts. Microsoft copyright
More informationScheduling Periodic Real-Time Tasks on Uniprocessor Systems. LS 12, TU Dortmund
Scheduling Periodic Real-Time Tasks on Uniprocessor Systems Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 08, Dec., 2015 Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 1 / 38 Periodic Control System Pseudo-code
More informationA framework for the timing analysis of dynamic branch predictors
A framework for the timing analysis of dynamic ranch predictors Claire Maïza INP Grenole, Verimag Grenole, France claire.maiza@imag.fr Christine Rochange IRIT - CNRS Université de Toulouse, France rochange@irit.fr
More informationNon-Preemptive and Limited Preemptive Scheduling. LS 12, TU Dortmund
Non-Preemptive and Limited Preemptive Scheduling LS 12, TU Dortmund 09 May 2017 (LS 12, TU Dortmund) 1 / 31 Outline Non-Preemptive Scheduling A General View Exact Schedulability Test Pessimistic Schedulability
More informationTheory of Computer Science. Theory of Computer Science. D7.1 Introduction. D7.2 Turing Machines as Words. D7.3 Special Halting Problem
Theory of Computer Science May 2, 2018 D7. Halting Problem and Reductions Theory of Computer Science D7. Halting Problem and Reductions Gabriele Röger University of Basel May 2, 2018 D7.1 Introduction
More informationFall 2008 CSE Qualifying Exam. September 13, 2008
Fall 2008 CSE Qualifying Exam September 13, 2008 1 Architecture 1. (Quan, Fall 2008) Your company has just bought a new dual Pentium processor, and you have been tasked with optimizing your software for
More informationECE 571 Advanced Microprocessor-Based Design Lecture 10
ECE 571 Advanced Microprocessor-Based Design Lecture 10 Vince Weaver http://web.eece.maine.edu/~vweaver vincent.weaver@maine.edu 23 February 2017 Announcements HW#5 due HW#6 will be posted 1 Oh No, More
More informationUnit 6: Branch Prediction
CIS 501: Computer Architecture Unit 6: Branch Prediction Slides developed by Joe Devie/, Milo Mar4n & Amir Roth at Upenn with sources that included University of Wisconsin slides by Mark Hill, Guri Sohi,
More informationSpecial Nodes for Interface
fi fi Special Nodes for Interface SW on processors Chip-level HW Board-level HW fi fi C code VHDL VHDL code retargetable compilation high-level synthesis SW costs HW costs partitioning (solve ILP) cluster
More informationPortland State University ECE 587/687. Branch Prediction
Portland State University ECE 587/687 Branch Prediction Copyright by Alaa Alameldeen and Haitham Akkary 2015 Branch Penalty Example: Comparing perfect branch prediction to 90%, 95%, 99% prediction accuracy,
More informationCS20a: Turing Machines (Oct 29, 2002)
CS20a: Turing Machines (Oct 29, 2002) So far: DFA = regular languages PDA = context-free languages Today: Computability 1 Handicapped machines DFA limitations Tape head moves only one direction 2-way DFA
More informationDepartment of Electrical and Computer Engineering University of Wisconsin - Madison. ECE/CS 752 Advanced Computer Architecture I.
Last (family) name: Solution First (given) name: Student I.D. #: Department of Electrical and Computer Engineering University of Wisconsin - Madison ECE/CS 752 Advanced Computer Architecture I Midterm
More informationNon-Work-Conserving Non-Preemptive Scheduling: Motivations, Challenges, and Potential Solutions
Non-Work-Conserving Non-Preemptive Scheduling: Motivations, Challenges, and Potential Solutions Mitra Nasri Chair of Real-time Systems, Technische Universität Kaiserslautern, Germany nasri@eit.uni-kl.de
More informationCOSE312: Compilers. Lecture 17 Intermediate Representation (2)
COSE312: Compilers Lecture 17 Intermediate Representation (2) Hakjoo Oh 2017 Spring Hakjoo Oh COSE312 2017 Spring, Lecture 17 May 31, 2017 1 / 19 Common Intermediate Representations Three-address code
More informationECS 120 Lesson 23 The Class P
ECS 120 Lesson 23 The Class P Oliver Kreylos Wednesday, May 23th, 2001 We saw last time how to analyze the time complexity of Turing Machines, and how to classify languages into complexity classes. We
More informationReal-Time Calculus. LS 12, TU Dortmund
Real-Time Calculus Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 09, Dec., 2014 Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 1 / 35 Arbitrary Deadlines The worst-case response time of τ i by only considering
More informationCaches in WCET Analysis Predictability, Competitiveness, Sensitivity
Caches in WCET Analysis Predictability, Competitiveness, Sensitivity Dissertation Zur Erlangung des Grades des Doktors der Ingenieurwissenschaften der Naturwissenschaftlich-Technischen Fakultäten der Universität
More informationGeneralizing Data-flow Analysis
Generalizing Data-flow Analysis Announcements PA1 grades have been posted Today Other types of data-flow analysis Reaching definitions, available expressions, reaching constants Abstracting data-flow analysis
More informationV Honors Theory of Computation
V22.0453-001 Honors Theory of Computation Problem Set 3 Solutions Problem 1 Solution: The class of languages recognized by these machines is the exactly the class of regular languages, thus this TM variant
More informationLecture notes on Turing machines
Lecture notes on Turing machines Ivano Ciardelli 1 Introduction Turing machines, introduced by Alan Turing in 1936, are one of the earliest and perhaps the best known model of computation. The importance
More informationChe-Wei Chang Department of Computer Science and Information Engineering, Chang Gung University
Che-Wei Chang chewei@mail.cgu.edu.tw Department of Computer Science and Information Engineering, Chang Gung University } 2017/11/15 Midterm } 2017/11/22 Final Project Announcement 2 1. Introduction 2.
More informationAbstractions and Decision Procedures for Effective Software Model Checking
Abstractions and Decision Procedures for Effective Software Model Checking Prof. Natasha Sharygina The University of Lugano, Carnegie Mellon University Microsoft Summer School, Moscow, July 2011 Lecture
More informationBinary Decision Diagrams and Symbolic Model Checking
Binary Decision Diagrams and Symbolic Model Checking Randy Bryant Ed Clarke Ken McMillan Allen Emerson CMU CMU Cadence U Texas http://www.cs.cmu.edu/~bryant Binary Decision Diagrams Restricted Form of
More informationTHE ZCACHE: DECOUPLING WAYS AND ASSOCIATIVITY. Daniel Sanchez and Christos Kozyrakis Stanford University
THE ZCACHE: DECOUPLING WAYS AND ASSOCIATIVITY Daniel Sanchez and Christos Kozyrakis Stanford University MICRO-43, December 6 th 21 Executive Summary 2 Mitigating the memory wall requires large, highly
More informationDecidability (What, stuff is unsolvable?)
University of Georgia Fall 2014 Outline Decidability Decidable Problems for Regular Languages Decidable Problems for Context Free Languages The Halting Problem Countable and Uncountable Sets Diagonalization
More informationComplexity: Some examples
Algorithms and Architectures III: Distributed Systems H-P Schwefel, Jens M. Pedersen Mm6 Distributed storage and access (jmp) Mm7 Introduction to security aspects (hps) Mm8 Parallel complexity (hps) Mm9
More informationCSCI-564 Advanced Computer Architecture
CSCI-564 Advanced Computer Architecture Lecture 8: Handling Exceptions and Interrupts / Superscalar Bo Wu Colorado School of Mines Branch Delay Slots (expose control hazard to software) Change the ISA
More informationDisconnecting Networks via Node Deletions
1 / 27 Disconnecting Networks via Node Deletions Exact Interdiction Models and Algorithms Siqian Shen 1 J. Cole Smith 2 R. Goli 2 1 IOE, University of Michigan 2 ISE, University of Florida 2012 INFORMS
More informationAlgorithmic verification
Algorithmic verification Ahmed Rezine IDA, Linköpings Universitet Hösttermin 2018 Outline Overview Model checking Symbolic execution Outline Overview Model checking Symbolic execution Program verification
More informationSoftware Verification
Software Verification Grégoire Sutre LaBRI, University of Bordeaux, CNRS, France Summer School on Verification Technology, Systems & Applications September 2008 Grégoire Sutre Software Verification VTSA
More informationReal-Time Scheduling and Resource Management
ARTIST2 Summer School 2008 in Europe Autrans (near Grenoble), France September 8-12, 2008 Real-Time Scheduling and Resource Management Lecturer: Giorgio Buttazzo Full Professor Scuola Superiore Sant Anna
More informationMultiprocessor Scheduling I: Partitioned Scheduling. LS 12, TU Dortmund
Multiprocessor Scheduling I: Partitioned Scheduling Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 22/23, June, 2015 Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 1 / 47 Outline Introduction to Multiprocessor
More information3/11/18. Final Code Generation and Code Optimization
Final Code Generation and Code Optimization 1 2 3 for ( i=0; i < N; i++) { base = &a[0]; crt = *(base + i); } original code base = &a[0]; for ( i=0; i < N; i++) { crt = *(base + i); } optimized code e1
More informationCMP 338: Third Class
CMP 338: Third Class HW 2 solution Conversion between bases The TINY processor Abstraction and separation of concerns Circuit design big picture Moore s law and chip fabrication cost Performance What does
More information(a) Definition of TMs. First Problem of URMs
Sec. 4: Turing Machines First Problem of URMs (a) Definition of the Turing Machine. (b) URM computable functions are Turing computable. (c) Undecidability of the Turing Halting Problem That incrementing
More informationOn the analysis of random replacement caches using static probabilistic timing methods for multi-path programs
Real-Time Syst (208) 54:307 388 https://doi.org/0.007/s24-07-9295-2 On the analysis of random replacement caches using static probabilistic timing methods for multi-path programs Benjamin Lesage David
More informationCompiling Techniques
Lecture 11: Introduction to 13 November 2015 Table of contents 1 Introduction Overview The Backend The Big Picture 2 Code Shape Overview Introduction Overview The Backend The Big Picture Source code FrontEnd
More informationSchedulability and Optimization Analysis for Non-Preemptive Static Priority Scheduling Based on Task Utilization and Blocking Factors
Schedulability and Optimization Analysis for Non-Preemptive Static Priority Scheduling Based on Task Utilization and Blocking Factors Georg von der Brüggen, Jian-Jia Chen, Wen-Hung Huang Department of
More informationCEC 450 Real-Time Systems
CEC 450 Real-Time Systems Lecture 3 Real-Time Services Part 2 (Rate Monotonic Theory - Policy and Feasibility for RT Services) September 7, 2018 Sam Siewert Quick Review Service Utility RM Policy, Feasibility,
More informationCSE 200 Lecture Notes Turing machine vs. RAM machine vs. circuits
CSE 200 Lecture Notes Turing machine vs. RAM machine vs. circuits Chris Calabro January 13, 2016 1 RAM model There are many possible, roughly equivalent RAM models. Below we will define one in the fashion
More informationTime and Schedulability Analysis of Stateflow Models
Time and Schedulability Analysis of Stateflow Models Marco Di Natale Scuola Superiore S. Anna Haibo Zeng Mc Gill University Outline Context: MBD of Embedded Systems Relationship with PBD An Introduction
More information[2] Predicting the direction of a branch is not enough. What else is necessary?
[2] When we talk about the number of operands in an instruction (a 1-operand or a 2-operand instruction, for example), what do we mean? [2] What are the two main ways to define performance? [2] Predicting
More informationPart I: Definitions and Properties
Turing Machines Part I: Definitions and Properties Finite State Automata Deterministic Automata (DFSA) M = {Q, Σ, δ, q 0, F} -- Σ = Symbols -- Q = States -- q 0 = Initial State -- F = Accepting States
More information19. Extending the Capacity of Formal Engines. Bottlenecks for Fully Automated Formal Verification of SoCs
19. Extending the Capacity of Formal Engines 1 19. Extending the Capacity of Formal Engines Jacob Abraham Department of Electrical and Computer Engineering The University of Texas at Austin Verification
More informationDecision Procedures An Algorithmic Point of View
An Algorithmic Point of View ILP References: Integer Programming / Laurence Wolsey Deciding ILPs with Branch & Bound Intro. To mathematical programming / Hillier, Lieberman Daniel Kroening and Ofer Strichman
More informationAccelerating Decoupled Look-ahead to Exploit Implicit Parallelism
Accelerating Decoupled Look-ahead to Exploit Implicit Parallelism Raj Parihar Advisor: Prof. Michael C. Huang March 22, 2013 Raj Parihar Accelerating Decoupled Look-ahead to Exploit Implicit Parallelism
More informationMixed Criticality in Safety-Critical Systems. LS 12, TU Dortmund
Mixed Criticality in Safety-Critical Systems Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 18, July, 2016 Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 1 / 25 Motivation today s embedded systems use complex
More informationSchedulability of Periodic and Sporadic Task Sets on Uniprocessor Systems
Schedulability of Periodic and Sporadic Task Sets on Uniprocessor Systems Jan Reineke Saarland University July 4, 2013 With thanks to Jian-Jia Chen! Jan Reineke July 4, 2013 1 / 58 Task Models and Scheduling
More informationMm7 Intro to distributed computing (jmp) Mm8 Backtracking, 2-player games, genetic algorithms (hps) Mm9 Complex Problems in Network Planning (JMP)
Algorithms and Architectures II H-P Schwefel, Jens M. Pedersen Mm6 Advanced Graph Algorithms (hps) Mm7 Intro to distributed computing (jmp) Mm8 Backtracking, 2-player games, genetic algorithms (hps) Mm9
More informationENEE350 Lecture Notes-Weeks 14 and 15
Pipelining & Amdahl s Law ENEE350 Lecture Notes-Weeks 14 and 15 Pipelining is a method of processing in which a problem is divided into a number of sub problems and solved and the solu8ons of the sub problems
More informationExploring the Potential of Instruction-Level Parallelism of Exposed Datapath Architectures with Buffered Processing Units
Exploring the Potential of Instruction-Level Parallelism of Exposed Datapath Architectures with Buffered Processing Units Anoop Bhagyanath and Klaus Schneider Embedded Systems Chair University of Kaiserslautern
More informationCMP N 301 Computer Architecture. Appendix C
CMP N 301 Computer Architecture Appendix C Outline Introduction Pipelining Hazards Pipelining Implementation Exception Handling Advanced Issues (Dynamic Scheduling, Out of order Issue, Superscalar, etc)
More informationCPU SCHEDULING RONG ZHENG
CPU SCHEDULING RONG ZHENG OVERVIEW Why scheduling? Non-preemptive vs Preemptive policies FCFS, SJF, Round robin, multilevel queues with feedback, guaranteed scheduling 2 SHORT-TERM, MID-TERM, LONG- TERM
More informationCache-Oblivious Algorithms
Cache-Oblivious Algorithms 1 Cache-Oblivious Model 2 The Unknown Machine Algorithm C program gcc Object code linux Execution Can be executed on machines with a specific class of CPUs Algorithm Java program
More informationProcessor Design & ALU Design
3/8/2 Processor Design A. Sahu CSE, IIT Guwahati Please be updated with http://jatinga.iitg.ernet.in/~asahu/c22/ Outline Components of CPU Register, Multiplexor, Decoder, / Adder, substractor, Varity of
More informationOutline. policies for the first part. with some potential answers... MCS 260 Lecture 10.0 Introduction to Computer Science Jan Verschelde, 9 July 2014
Outline 1 midterm exam on Friday 11 July 2014 policies for the first part 2 questions with some potential answers... MCS 260 Lecture 10.0 Introduction to Computer Science Jan Verschelde, 9 July 2014 Intro
More informationState Machines. Example FSM: Roboant
page 1 State Machines 1) State Machine Design 2) How can we improve on FSMs? 3) Turing Machines 4) Computability Oh genie, will you now tell me what it means to compute? Doctor, I think you ve built a
More informationCache-Oblivious Algorithms
Cache-Oblivious Algorithms 1 Cache-Oblivious Model 2 The Unknown Machine Algorithm C program gcc Object code linux Execution Can be executed on machines with a specific class of CPUs Algorithm Java program
More informationThere are two main techniques for showing that problems are undecidable: diagonalization and reduction
Reducibility 1 There are two main techniques for showing that problems are undecidable: diagonalization and reduction 2 We say that a problem A is reduced to a problem B if the decidability of A follows
More informationFall 2011 Prof. Hyesoon Kim
Fall 2011 Prof. Hyesoon Kim Add: 2 cycles FE_stage add r1, r2, r3 FE L ID L EX L MEM L WB L add add sub r4, r1, r3 sub sub add add mul r5, r2, r3 mul sub sub add add mul sub sub add add mul sub sub add
More informationProportional Share Resource Allocation Outline. Proportional Share Resource Allocation Concept
Proportional Share Resource Allocation Outline Fluid-flow resource allocation models» Packet scheduling in a network Proportional share resource allocation models» CPU scheduling in an operating system
More informationPerformance, Power & Energy. ELEC8106/ELEC6102 Spring 2010 Hayden Kwok-Hay So
Performance, Power & Energy ELEC8106/ELEC6102 Spring 2010 Hayden Kwok-Hay So Recall: Goal of this class Performance Reconfiguration Power/ Energy H. So, Sp10 Lecture 3 - ELEC8106/6102 2 PERFORMANCE EVALUATION
More informationTuring Machines, diagonalization, the halting problem, reducibility
Notes on Computer Theory Last updated: September, 015 Turing Machines, diagonalization, the halting problem, reducibility 1 Turing Machines A Turing machine is a state machine, similar to the ones we have
More informationBranch Prediction using Advanced Neural Methods
Branch Prediction using Advanced Neural Methods Sunghoon Kim Department of Mechanical Engineering University of California, Berkeley shkim@newton.berkeley.edu Abstract Among the hardware techniques, two-level
More informationIntelligent Agents. Formal Characteristics of Planning. Ute Schmid. Cognitive Systems, Applied Computer Science, Bamberg University
Intelligent Agents Formal Characteristics of Planning Ute Schmid Cognitive Systems, Applied Computer Science, Bamberg University Extensions to the slides for chapter 3 of Dana Nau with contributions by
More informationData Structures in Java
Data Structures in Java Lecture 21: Introduction to NP-Completeness 12/9/2015 Daniel Bauer Algorithms and Problem Solving Purpose of algorithms: find solutions to problems. Data Structures provide ways
More information1 Reals are Uncountable
CS 30: Discrete Math in CS (Winter 2019): Lecture 6 Date: 11th January, 2019 (Friday) Topic: Uncountability and Undecidability Disclaimer: These notes have not gone through scrutiny and in all probability
More informationIntroduction to Computer Science and Programming for Astronomers
Introduction to Computer Science and Programming for Astronomers Lecture 8. István Szapudi Institute for Astronomy University of Hawaii March 7, 2018 Outline Reminder 1 Reminder 2 3 4 Reminder We have
More informationCSE. 1. In following code. addi. r1, skip1 xor in r2. r3, skip2. counter r4, top. taken): PC1: PC2: PC3: TTTTTT TTTTTT
CSE 560 Practice Problem Set 4 Solution 1. In this question, you will examine several different schemes for branch prediction, using the following code sequence for a simple load store ISA with no branch
More informationAnalysis of Algorithms [Reading: CLRS 2.2, 3] Laura Toma, csci2200, Bowdoin College
Analysis of Algorithms [Reading: CLRS 2.2, 3] Laura Toma, csci2200, Bowdoin College Why analysis? We want to predict how the algorithm will behave (e.g. running time) on arbitrary inputs, and how it will
More informationCPSC 421: Tutorial #1
CPSC 421: Tutorial #1 October 14, 2016 Set Theory. 1. Let A be an arbitrary set, and let B = {x A : x / x}. That is, B contains all sets in A that do not contain themselves: For all y, ( ) y B if and only
More informationEfficient TDM-based Arbitration for Mixed-Criticality Systems on Multi-Cores
Efficient TDM-based Arbitration for Mixed-Criticality Systems on Multi-Cores Florian Brandner with Farouk Hebbache, 2 Mathieu Jan, 2 Laurent Pautet LTCI, Télécom ParisTech, Université Paris-Saclay 2 CEA
More informationComputational Models Lecture 8 1
Computational Models Lecture 8 1 Handout Mode Nachum Dershowitz & Yishay Mansour. Tel Aviv University. May 17 22, 2017 1 Based on frames by Benny Chor, Tel Aviv University, modifying frames by Maurice
More informationCS154, Lecture 10: Rice s Theorem, Oracle Machines
CS154, Lecture 10: Rice s Theorem, Oracle Machines Moral: Analyzing Programs is Really, Really Hard But can we more easily tell when some program analysis problem is undecidable? Problem 1 Undecidable
More information/ : Computer Architecture and Design
16.482 / 16.561: Computer Architecture and Design Summer 2015 Homework #5 Solution 1. Dynamic scheduling (30 points) Given the loop below: DADDI R3, R0, #4 outer: DADDI R2, R1, #32 inner: L.D F0, 0(R1)
More informationMicroprocessor Power Analysis by Labeled Simulation
Microprocessor Power Analysis by Labeled Simulation Cheng-Ta Hsieh, Kevin Chen and Massoud Pedram University of Southern California Dept. of EE-Systems Los Angeles CA 989 Outline! Introduction! Problem
More informationDigital Integrated Circuits A Design Perspective. Arithmetic Circuits. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.
Digital Integrated Circuits A Design Perspective Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic Arithmetic Circuits January, 2003 1 A Generic Digital Processor MEMORY INPUT-OUTPUT CONTROL DATAPATH
More informationStatic Program Analysis using Abstract Interpretation
Static Program Analysis using Abstract Interpretation Introduction Static Program Analysis Static program analysis consists of automatically discovering properties of a program that hold for all possible
More information[2] Predicting the direction of a branch is not enough. What else is necessary?
[2] What are the two main ways to define performance? [2] Predicting the direction of a branch is not enough. What else is necessary? [2] The power consumed by a chip has increased over time, but the clock
More informationAutomatic Resource Bound Analysis and Linear Optimization. Jan Hoffmann
Automatic Resource Bound Analysis and Linear Optimization Jan Hoffmann Beyond worst-case analysis Beyond worst-case analysis of algorithms Beyond worst-case analysis of algorithms Resource bound analysis
More informationPerformance Prediction Models
Performance Prediction Models Shruti Padmanabha, Andrew Lukefahr, Reetuparna Das, Scott Mahlke {shrupad,lukefahr,reetudas,mahlke}@umich.edu Gem5 workshop Micro 2012 December 2, 2012 Composite Cores Pushing
More informationCS20a: Turing Machines (Oct 29, 2002)
CS20a: Turing Machines (Oct 29, 2002) So far: DFA = regular languages PDA = context-free languages Today: Computability 1 Church s thesis The computable functions are the same as the partial recursive
More informationCOVER SHEET: Problem#: Points
EEL 4712 Midterm 3 Spring 2017 VERSION 1 Name: UFID: Sign here to give permission for your test to be returned in class, where others might see your score: IMPORTANT: Please be neat and write (or draw)
More informationUnit 1A: Computational Complexity
Unit 1A: Computational Complexity Course contents: Computational complexity NP-completeness Algorithmic Paradigms Readings Chapters 3, 4, and 5 Unit 1A 1 O: Upper Bounding Function Def: f(n)= O(g(n)) if
More informationCSE 105 THEORY OF COMPUTATION. Spring 2018 review class
CSE 105 THEORY OF COMPUTATION Spring 2018 review class Today's learning goals Summarize key concepts, ideas, themes from CSE 105. Approach your final exam studying with confidence. Identify areas to focus
More informationUndecidable Problems. Z. Sawa (TU Ostrava) Introd. to Theoretical Computer Science May 12, / 65
Undecidable Problems Z. Sawa (TU Ostrava) Introd. to Theoretical Computer Science May 12, 2018 1/ 65 Algorithmically Solvable Problems Let us assume we have a problem P. If there is an algorithm solving
More informationInput Decidable Language -- Program Halts on all Input Encoding of Input -- Natural Numbers Encoded in Binary or Decimal, Not Unary
Complexity Analysis Complexity Theory Input Decidable Language -- Program Halts on all Input Encoding of Input -- Natural Numbers Encoded in Binary or Decimal, Not Unary Output TRUE or FALSE Time and Space
More informationFurther discussion of Turing machines
Further discussion of Turing machines In this lecture we will discuss various aspects of decidable and Turing-recognizable languages that were not mentioned in previous lectures. In particular, we will
More informationCSE 105 THEORY OF COMPUTATION
CSE 105 THEORY OF COMPUTATION Spring 2017 http://cseweb.ucsd.edu/classes/sp17/cse105-ab/ Today's learning goals Summarize key concepts, ideas, themes from CSE 105. Approach your final exam studying with
More information