Reducing NVM Writes with Optimized Shadow Paging

Size: px

Start display at page:

Download "Reducing NVM Writes with Optimized Shadow Paging"

Allison Davidson
5 years ago
Views:

1 Reducing NVM Writes with Optimized Shadow Paging Yuanjiang Ni, Jishen Zhao, Daniel Bittman, Ethan L. Miller Center for Research in Storage Systems University of California, Santa Cruz

2 Emerging Technology Memory Storage Byte-Addressable High speed Volatile Small capacity BNVM Block-Addressable Slow Durable Large Capacity 2

3 New Storage Architecture read()/write() cache-line load/store cache-line flush DRAM BNVM fsync(), etc page HDD/SSD 3

4 Crash Consistency A 1,000,000 XBEGIN B 1,000,000 A.account -= 500,000 B.account += 500,000 XEND A 500,000 B 1,000,000 Crash-consistency is a must! A, B lost money! 4

5 Opportunities Leverage byte-addressability e.g Fine-grained logging. 5

6 Opportunities Leverage byte-addressability e.g Fine-grained logging. Leverage virtual memory Indirection is necessary for many techniques Can we directly leverage virtual memory indirection? 6

7 Opportunities Leverage byte-addressability e.g Fine-grained logging. Leverage virtual memory Indirection is necessary for many techniques Can we directly leverage virtual memory indirection? Explore Hardware Support Intel proposes instructions such as clwb for especially persistent memory. Other HW supports? 7

8 Inefficiencies of Existing Approaches Extra writes to NVM are bad. Performance Endurance 8

9 Inefficiencies of Existing Approaches X Extra Writes Y Newly Written Z Data from Last Commit Extra writes to NVM are bad. Data Log A B A B Write twice Performance Logging C D Endurance Logging Write the actual data twice 9

10 Inefficiencies of Existing Approaches X Extra Writes Y Newly Written Z Data from Last Commit Extra writes to NVM are bad. Data Log A B A B Write twice Performance Logging C D Endurance Logging P0 P1 Write the actual data twice Shadow A B A B Paging shadow paging C D C D Copy unmodified Copy unmodified data 10

11 Inefficiencies of Existing Approaches X Extra Writes Y Newly Written Z Data from Last Commit Extra writes to NVM are bad. Data Log A B A B Write twice Performance Logging C D Endurance Logging P0 P1 Write the actual data twice Shadow A B A B Paging shadow paging C D C D Copy unmodified Copy unmodified data P0 P1 Our approach - OSP A B A B OSP C D 11

12 Cache-line Level Mapping Track modifications at cache line level? Can t simply reduce page size! 12

13 Cache-line Level Mapping P0 P1 Two bits per cache line Committed Bit - Where is the old state? Updated Bit - has this cache line been updated? Only required when pages are being actively updated! 13

14 TLB Extension Wider TLB entry Committed bitmap Updated bitmap Additional PPN 14

15 TLB Extension Wider TLB entry Committed bitmap Updated bitmap Additional PPN Minimal impact on run-time performance. Require only few gate delays Done in parallel with cache access (e.g. VIPT caches) 15

16 TLB Extension Wider TLB entry Committed bitmap Updated bitmap Additional PPN Minimal impact on run-time performance. Require only few gate delays Done in parallel with cache access (e.g. VIPT caches) Need not change the PTE Additional information required only when pages are actively being updated. 16

17 Example P0 P1 VPN P0 P1 Committed Updated Wider TLB entry V P0 P

18 Read the cache line 0 Read from P ( Committed_bit XOR updated_bit ) P0 P1 VPN P0 P1 Committed Updated Wider TLB entry V P0 P

19 Update the cache line 0 Writes go to P ( Committed_bit XOR 1 ) And, set the updated_bit P0 P1 VPN P0 P1 Committed Updated Wider TLB entry V P0 P

20 Update the cache line 1 Writes go to P ( Committed_bit XOR 1 ) And, set the updated_bit P0 P1 VPN P0 P1 Committed Updated Wider TLB entry V P0 P

21 Commit committed bitmap = (committed bitmap XOR updated bitmap) And, clear the updated bitmap P0 Before After P0 P1 P1 VPN P0 P1 Committed Updated VPN P0 P1 Committed Updated V P0 P V P0 P

22 Abort Clear the updated bitmap P0 Before After P0 P1 P1 VPN P0 P1 Committed Updated VPN P0 P1 Committed Updated V P0 P V P0 P

23 Page Consolidation Double physical pages can waste memory space Reduce storage cost Consolidating virtual pages that are not being actively updated. Copy valid data into one page and free the other one. TLB eviction identifies inactive virtual pages. Page Consolidation is not a per-transaction overhead. 23

24 Multi-page Atomicity Consistent State Table VPN Committed V1 V2 Can't atomically update separate locations in-place 24

25 Lightweight Journaling Consistent State Table VPN Committed Journaling Completed V1 V2 V1 Bitmap1 V2 Bitmap2 TX-END V2 Bitmap2 uncompleted Lightweight and not a per-update overhead! 25

26 Experiment Setup Based on McSimA+ 64-entry L1 DTLB Transactional workloads: array swap (SPS), hashtable (HT), RBtree (RBT), B-tree (BT) *-uni : inserts/deletes in a uniformly random fashion *-zipf : inserts/deletes following Zipf distribution 1G ~ 4G footprint Metric: CPU flush 26

27 CPU Flushes CPU flushes (normalized) SPS baseline (undo-log) HT-uni HT-zipf RBT-uni RBT-zipf BT-uni BT-zipf Reduces the number of CPU flushes by 1.6x on average OSP 27

28 Breakdown CPU flushes (normalized) in-place journaling consolidation SPS HT-uni HT-zipf RBT-uni RBT-zipf BT-uni BT-zipf Nearly eliminate all of the consistency cost for workloads with locality 28

29 Discussion Limitations. Size of a transaction is limited by the TLB capacity Fallback path. TLB coherence for multi-threaded processes Overhead, correctness Work with virtual cache 29

30 Conclusion Use virtual memory system to implement efficient, transactional update avoid extra copies required by logging Keep two copies of each page being modified Track modifications at the cache line level Avoid the inefficiencies of traditional shadow paging Small changes to hardware: TLB extension Preliminary simulation shows great promise 30

31 Questions Collaborators: Yuanjiang Ni Jishen Zhao ) Daniel Bittman ) Ethan Miller 31

ECE 571 Advanced Microprocessor-Based Design Lecture 10

ECE 571 Advanced Microprocessor-Based Design Lecture 10 Vince Weaver http://web.eece.maine.edu/~vweaver vincent.weaver@maine.edu 23 February 2017 Announcements HW#5 due HW#6 will be posted 1 Oh No, More