Greeting, FYI, we noticed a 8.9% improvement of vm-scalability.median due to commit: commit: db001b7115ac61820c5bc44994537bef3853bac6 ("[PATCHv2 7/8] thp: Change CoW semantics for anon-THP") url: https://github.com/0day-ci/linux/commits/Kirill-A-Shutemov/thp-khugepaged-improvements-and-CoW-semantics/20200405-000201 base: https://git.kernel.org/cgit/linux/kernel/git/shuah/linux-kselftest.git next in testcase: vm-scalability on test machine: 16 threads Intel(R) Xeon(R) E-2278G CPU @ 3.40GHz with 32G memory with following parameters: runtime: 300s size: 8T test: anon-cow-seq cpufreq_governor: performance ucode: 0xca test-description: The motivation behind this suite is to exercise functions and regions of the mm/ of the Linux kernel which are of interest to us. test-url: https://git.kernel.org/cgit/linux/kernel/git/wfg/vm-scalability.git/ Details are as below: --------------------------------------------------------------------------------------------------> To reproduce: git clone https://github.com/intel/lkp-tests.git cd lkp-tests bin/lkp install job.yaml # job file is attached in this email bin/lkp run job.yaml ========================================================================================= compiler/cpufreq_governor/kconfig/rootfs/runtime/size/tbox_group/test/testcase/ucode: gcc-7/performance/x86_64-rhel-7.6/debian-x86_64-20191114.cgz/300s/8T/lkp-cfl-e1/anon-cow-seq/vm-scalability/0xca commit: 385691e5d6 ("khugepaged: Allow to collapse PTE-mapped compound pages") db001b7115 ("thp: Change CoW semantics for anon-THP") 385691e5d6e281b7 db001b7115ac61820c5bc449945 ---------------- --------------------------- fail:runs %reproduction fail:runs | | | 1:4 -25% :4 dmesg.WARNING:at#for_ip_interrupt_entry/0x :4 116% 4:4 perf-profile.calltrace.cycles-pp.sync_regs.error_entry.do_access :4 128% 5:4 perf-profile.calltrace.cycles-pp.error_entry.do_access :4 5% 0:4 perf-profile.children.cycles-pp.error_exit :4 155% 6:4 perf-profile.children.cycles-pp.error_entry :4 16% 0:4 perf-profile.self.cycles-pp.error_entry %stddev %change %stddev \ | \ 661885 +8.9% 720550 vm-scalability.median 2.45 ± 4% +0.6 3.05 ± 3% vm-scalability.median_stddev% 10535433 +9.5% 11539514 vm-scalability.throughput 3174084 +20717.0% 6.607e+08 vm-scalability.time.minor_page_faults 1460 -9.7% 1318 vm-scalability.time.percent_of_cpu_this_job_got 1905 +16.7% 2223 vm-scalability.time.system_time 2499 -29.9% 1752 vm-scalability.time.user_time 145571 +37412.8% 54607973 vm-scalability.time.voluntary_context_switches 3.015e+09 -1.4% 2.971e+09 vm-scalability.workload 11.55 +2.7% 11.86 ± 3% boot-time.dhcp 7.25 ± 3% +10.1 17.32 mpstat.cpu.all.idle% 0.00 ± 50% +0.1 0.05 ± 3% mpstat.cpu.all.iowait% 40.47 +5.9 46.36 mpstat.cpu.all.sys% 52.26 -16.0 36.24 mpstat.cpu.all.usr% 8262018 ± 2% -8.9% 7529530 meminfo.Active 8259752 ± 2% -8.9% 7527266 meminfo.Active(anon) 7490638 ± 2% -85.2% 1105680 meminfo.AnonHugePages 8171722 ± 2% -8.2% 7501571 meminfo.AnonPages 60919 -11.5% 53896 ± 2% meminfo.max_used_kB 7.25 ± 5% +137.9% 17.25 ± 2% vmstat.cpu.id 51.25 -30.2% 35.75 vmstat.cpu.us 14.75 ± 2% -11.9% 13.00 vmstat.procs.r 2255 +15841.3% 359555 vmstat.system.cs 32997 +7.6% 35493 vmstat.system.in 2956469 ± 4% +924.0% 30273837 cpuidle.C1.time 92358 +3638.2% 3452555 cpuidle.C1.usage 6455483 ±134% +519.3% 39981838 cpuidle.C1E.time 101469 ±119% +2511.3% 2649633 cpuidle.C1E.usage 3.337e+08 ± 8% +78.1% 5.945e+08 ± 2% cpuidle.C6.time 635855 ± 26% +94.1% 1234310 ± 2% cpuidle.C6.usage 410302 ± 2% +30791.0% 1.267e+08 cpuidle.POLL.time 61252 ± 5% +82036.6% 50310539 cpuidle.POLL.usage 7651 ± 2% +24.8% 9545 ± 2% slabinfo.anon_vma.active_objs 7674 ± 2% +24.9% 9585 ± 2% slabinfo.anon_vma.num_objs 12710 ± 3% +13.6% 14435 slabinfo.anon_vma_chain.active_objs 12827 ± 3% +13.5% 14561 slabinfo.anon_vma_chain.num_objs 1136 ± 2% -9.5% 1028 ± 2% slabinfo.pid.active_objs 1136 ± 2% -9.5% 1028 ± 2% slabinfo.pid.num_objs 967.00 ± 4% -8.2% 888.00 ± 6% slabinfo.signal_cache.active_objs 967.00 ± 4% -8.2% 888.00 ± 6% slabinfo.signal_cache.num_objs 5282 +11.8% 5906 ± 2% slabinfo.vmap_area.active_objs 5282 +11.8% 5906 ± 2% slabinfo.vmap_area.num_objs 2029580 -7.0% 1887278 proc-vmstat.nr_active_anon 2014867 -6.7% 1880696 proc-vmstat.nr_anon_pages 3609 -85.1% 538.75 proc-vmstat.nr_anon_transparent_hugepages 577826 +2.5% 592543 proc-vmstat.nr_dirty_background_threshold 1157066 +2.5% 1186536 proc-vmstat.nr_dirty_threshold 5854882 +2.5% 5999717 proc-vmstat.nr_free_pages 8505 -2.6% 8280 proc-vmstat.nr_page_table_pages 2029579 -7.0% 1887278 proc-vmstat.nr_zone_active_anon 4868872 +13497.1% 6.62e+08 proc-vmstat.numa_hit 4868872 +13497.1% 6.62e+08 proc-vmstat.numa_local 7.138e+08 -1.4% 7.035e+08 proc-vmstat.pgalloc_normal 3569603 +18417.3% 6.61e+08 proc-vmstat.pgfault 7.126e+08 -1.5% 7.021e+08 proc-vmstat.pgfree 1387254 -94.2% 80432 proc-vmstat.thp_fault_alloc 68172 ± 14% +40.7% 95884 ± 6% softirqs.CPU0.RCU 10934 ± 2% +494.1% 64962 ± 3% softirqs.CPU0.SCHED 68793 ± 14% +41.7% 97477 ± 4% softirqs.CPU1.RCU 9262 ± 3% +547.1% 59937 softirqs.CPU1.SCHED 70384 ± 13% +33.5% 93989 ± 5% softirqs.CPU10.RCU 9791 ± 5% +553.8% 64014 ± 2% softirqs.CPU10.SCHED 69130 ± 13% +36.2% 94161 ± 4% softirqs.CPU11.RCU 9075 ± 3% +563.6% 60227 ± 3% softirqs.CPU11.SCHED 69369 ± 13% +35.1% 93751 ± 4% softirqs.CPU12.RCU 9408 ± 4% +525.0% 58796 ± 2% softirqs.CPU12.SCHED 68466 ± 13% +36.9% 93733 ± 3% softirqs.CPU13.RCU 8734 ± 4% +614.3% 62391 ± 6% softirqs.CPU13.SCHED 69162 ± 13% +34.8% 93216 ± 4% softirqs.CPU14.RCU 8821 ± 5% +633.3% 64686 ± 2% softirqs.CPU14.SCHED 68991 ± 13% +35.7% 93629 ± 3% softirqs.CPU15.RCU 8440 ± 2% +664.5% 64527 softirqs.CPU15.SCHED 70296 ± 13% +35.7% 95389 ± 5% softirqs.CPU2.RCU 9776 ± 3% +512.6% 59886 ± 3% softirqs.CPU2.SCHED 68812 ± 11% +38.0% 94967 ± 4% softirqs.CPU3.RCU 8824 ± 4% +574.8% 59548 softirqs.CPU3.SCHED 69540 ± 13% +34.5% 93520 ± 4% softirqs.CPU4.RCU 8784 +574.7% 59267 softirqs.CPU4.SCHED 68316 ± 13% +37.6% 93983 ± 3% softirqs.CPU5.RCU 8757 ± 4% +587.3% 60193 ± 4% softirqs.CPU5.SCHED 68163 ± 13% +38.3% 94246 ± 4% softirqs.CPU6.RCU 8527 ± 3% +597.3% 59464 ± 3% softirqs.CPU6.SCHED 69843 ± 15% +35.1% 94368 ± 4% softirqs.CPU7.RCU 8807 ± 6% +598.5% 61519 ± 2% softirqs.CPU7.SCHED 70328 ± 11% +35.8% 95484 ± 3% softirqs.CPU8.RCU 8965 ± 4% +592.6% 62087 ± 2% softirqs.CPU8.SCHED 69489 ± 13% +37.5% 95516 ± 4% softirqs.CPU9.RCU 9276 +599.8% 64911 softirqs.CPU9.SCHED 1107264 ± 13% +36.7% 1513321 ± 4% softirqs.RCU 146191 +574.8% 986424 softirqs.SCHED 7383 -4.8% 7031 ± 5% interrupts.CAL:Function_call_interrupts 3255 ± 32% +1281.7% 44981 ± 13% interrupts.CPU0.RES:Rescheduling_interrupts 102.50 ±161% +2203.4% 2361 ± 48% interrupts.CPU0.TLB:TLB_shootdowns 3781 ± 10% +1161.7% 47714 ± 9% interrupts.CPU1.RES:Rescheduling_interrupts 49.50 ± 89% +6321.7% 3178 ± 59% interrupts.CPU1.TLB:TLB_shootdowns 3111 ± 50% +1826.1% 59929 ± 16% interrupts.CPU10.RES:Rescheduling_interrupts 1.75 ± 84% +2.8e+05% 4904 ±149% interrupts.CPU10.TLB:TLB_shootdowns 2849 ± 27% +1799.8% 54135 ± 11% interrupts.CPU11.RES:Rescheduling_interrupts 1.50 ±100% +1.3e+05% 1993 ± 74% interrupts.CPU11.TLB:TLB_shootdowns 3598 ± 39% +1264.6% 49097 ± 15% interrupts.CPU12.RES:Rescheduling_interrupts 8.25 ±112% +49384.8% 4082 ± 95% interrupts.CPU12.TLB:TLB_shootdowns 2585 ± 29% +1976.1% 53682 ± 21% interrupts.CPU13.RES:Rescheduling_interrupts 1.50 ± 74% +1.3e+05% 2023 ±107% interrupts.CPU13.TLB:TLB_shootdowns 2428 ± 33% +2382.2% 60274 ± 6% interrupts.CPU14.RES:Rescheduling_interrupts 7.50 ±135% +70603.3% 5302 ± 54% interrupts.CPU14.TLB:TLB_shootdowns 1560 ± 12% +3040.3% 48996 ± 21% interrupts.CPU15.RES:Rescheduling_interrupts 3.25 ± 59% +40446.2% 1317 ± 42% interrupts.CPU15.TLB:TLB_shootdowns 3550 ± 36% +1684.6% 63359 ± 10% interrupts.CPU2.RES:Rescheduling_interrupts 74.75 ± 66% +2869.6% 2219 ± 35% interrupts.CPU2.TLB:TLB_shootdowns 3251 ± 31% +1643.6% 56690 ± 17% interrupts.CPU3.RES:Rescheduling_interrupts 59.75 ±150% +3890.0% 2384 ± 54% interrupts.CPU3.TLB:TLB_shootdowns 2593 ± 18% +1716.1% 47092 ± 9% interrupts.CPU4.RES:Rescheduling_interrupts 159.75 ±111% +2730.8% 4522 ± 92% interrupts.CPU4.TLB:TLB_shootdowns 2748 ± 27% +1951.5% 56390 ± 22% interrupts.CPU5.RES:Rescheduling_interrupts 45.25 ± 82% +33129.3% 15036 ± 71% interrupts.CPU5.TLB:TLB_shootdowns 3137 ± 33% +1598.2% 53286 ± 5% interrupts.CPU6.RES:Rescheduling_interrupts 68.75 ±136% +6444.4% 4499 ± 44% interrupts.CPU6.TLB:TLB_shootdowns 3117 ± 43% +1544.0% 51245 ± 16% interrupts.CPU7.RES:Rescheduling_interrupts 62.25 ± 99% +19128.5% 11969 ±147% interrupts.CPU7.TLB:TLB_shootdowns 1895 ± 17% +2283.5% 45185 ± 5% interrupts.CPU8.RES:Rescheduling_interrupts 16.00 ± 89% +29306.2% 4705 ±139% interrupts.CPU8.TLB:TLB_shootdowns 2559 ± 19% +1809.0% 48861 ± 9% interrupts.CPU9.RES:Rescheduling_interrupts 1.25 ± 87% +2.7e+05% 3377 ±136% interrupts.CPU9.TLB:TLB_shootdowns 161373 -11.1% 143490 ± 3% interrupts.NMI:Non-maskable_interrupts 161373 -11.1% 143490 ± 3% interrupts.PMI:Performance_monitoring_interrupts 46023 ± 4% +1727.1% 840921 ± 6% interrupts.RES:Rescheduling_interrupts 663.50 ± 43% +11034.5% 73877 ± 32% interrupts.TLB:TLB_shootdowns 0.00 +2.5e+12% 25295 ±138% sched_debug.cfs_rq:/.MIN_vruntime.avg 0.00 +4e+13% 404725 ±138% sched_debug.cfs_rq:/.MIN_vruntime.max 138869 -9.9% 125160 sched_debug.cfs_rq:/.exec_clock.avg 139800 -9.4% 126706 sched_debug.cfs_rq:/.exec_clock.max 137693 -10.1% 123808 sched_debug.cfs_rq:/.exec_clock.min 562.56 ± 9% +43.3% 806.29 ± 6% sched_debug.cfs_rq:/.exec_clock.stddev 50346 -44.0% 28199 ± 39% sched_debug.cfs_rq:/.load.min 19925 ± 6% +211.2% 62008 ± 80% sched_debug.cfs_rq:/.load.stddev 317.08 ± 5% +31.6% 417.12 ± 19% sched_debug.cfs_rq:/.load_avg.max 80.67 ± 5% +29.7% 104.64 ± 22% sched_debug.cfs_rq:/.load_avg.stddev 0.00 +2.5e+12% 25295 ±138% sched_debug.cfs_rq:/.max_vruntime.avg 0.00 +4e+13% 404725 ±138% sched_debug.cfs_rq:/.max_vruntime.max 2218856 -16.1% 1862626 sched_debug.cfs_rq:/.min_vruntime.avg 2236445 -16.0% 1879701 sched_debug.cfs_rq:/.min_vruntime.max 2200953 -16.3% 1841893 sched_debug.cfs_rq:/.min_vruntime.min 0.83 -45.0% 0.46 ± 39% sched_debug.cfs_rq:/.nr_running.min 14.06 ± 8% -77.7% 3.13 ± 15% sched_debug.cfs_rq:/.nr_spread_over.avg 32.62 ± 14% -72.0% 9.12 ± 16% sched_debug.cfs_rq:/.nr_spread_over.max 4.54 ± 3% -97.2% 0.12 ±173% sched_debug.cfs_rq:/.nr_spread_over.min 7.95 ± 21% -68.2% 2.53 ± 9% sched_debug.cfs_rq:/.nr_spread_over.stddev 116.67 ± 13% +60.2% 186.88 ± 14% sched_debug.cfs_rq:/.runnable_load_avg.max 47.83 -57.1% 20.50 ± 40% sched_debug.cfs_rq:/.runnable_load_avg.min 17.63 ± 21% +131.3% 40.77 ± 32% sched_debug.cfs_rq:/.runnable_load_avg.stddev 107811 ± 3% +151.9% 271564 ± 71% sched_debug.cfs_rq:/.runnable_weight.max 50216 -43.8% 28199 ± 39% sched_debug.cfs_rq:/.runnable_weight.min 14957 ± 10% +288.2% 58065 ± 86% sched_debug.cfs_rq:/.runnable_weight.stddev 796.67 ± 14% -51.9% 383.00 ± 44% sched_debug.cfs_rq:/.util_avg.min 111.62 ± 39% +165.0% 295.83 ± 12% sched_debug.cfs_rq:/.util_avg.stddev 368.12 ± 30% -90.1% 36.58 ±170% sched_debug.cfs_rq:/.util_est_enqueued.min 189.12 ± 26% +75.3% 331.59 ± 8% sched_debug.cfs_rq:/.util_est_enqueued.stddev 5035 ± 2% -54.4% 2295 ± 70% sched_debug.cpu.curr->pid.min 130.97 ± 27% +618.1% 940.48 ± 64% sched_debug.cpu.curr->pid.stddev 0.00 ± 15% +60.3% 0.00 ± 9% sched_debug.cpu.next_balance.stddev 23904 +14336.6% 3451016 sched_debug.cpu.nr_switches.avg 36949 ± 5% +10772.3% 4017255 ± 3% sched_debug.cpu.nr_switches.max 17386 +16525.2% 2890597 ± 6% sched_debug.cpu.nr_switches.min 5179 ± 9% +6075.3% 319873 ± 3% sched_debug.cpu.nr_switches.stddev -14.50 +38.5% -20.08 sched_debug.cpu.nr_uninterruptible.min 9.08 ± 11% +25.2% 11.37 ± 9% sched_debug.cpu.nr_uninterruptible.stddev 20257 +16918.2% 3447384 sched_debug.cpu.sched_count.avg 31557 ± 5% +12615.0% 4012487 ± 3% sched_debug.cpu.sched_count.max 15362 +18695.2% 2887383 ± 6% sched_debug.cpu.sched_count.min 4345 ± 7% +7257.6% 319703 ± 3% sched_debug.cpu.sched_count.stddev 5702 +30037.0% 1718417 sched_debug.cpu.sched_goidle.avg 7501 ± 2% +26584.4% 2001731 ± 3% sched_debug.cpu.sched_goidle.max 4759 ± 2% +30133.3% 1438852 ± 6% sched_debug.cpu.sched_goidle.min 747.96 ± 9% +21257.1% 159741 ± 3% sched_debug.cpu.sched_goidle.stddev 9879 +17357.3% 1724754 sched_debug.cpu.ttwu_count.avg 17142 ± 9% +11596.4% 2005096 ± 2% sched_debug.cpu.ttwu_count.max 7267 ± 3% +19694.4% 1438640 ± 5% sched_debug.cpu.ttwu_count.min 2538 ± 15% +6258.3% 161393 ± 5% sched_debug.cpu.ttwu_count.stddev 9480 ± 6% -24.2% 7190 ± 10% sched_debug.cpu.ttwu_local.max 2078 ± 9% +35.5% 2815 ± 3% sched_debug.cpu.ttwu_local.min 1950 ± 3% -36.2% 1244 ± 16% sched_debug.cpu.ttwu_local.stddev 21.83 +6.0% 23.15 perf-stat.i.MPKI 1.037e+10 +42.0% 1.473e+10 perf-stat.i.branch-instructions 0.05 ± 3% +0.1 0.14 perf-stat.i.branch-miss-rate% 3369202 ± 4% +410.0% 17184006 perf-stat.i.branch-misses 60.86 -33.4 27.50 perf-stat.i.cache-miss-rate% 4.331e+08 -17.1% 3.589e+08 perf-stat.i.cache-misses 7.118e+08 +86.6% 1.328e+09 perf-stat.i.cache-references 2251 +15995.8% 362417 perf-stat.i.context-switches 1.89 -50.0% 0.94 perf-stat.i.cpi 6.148e+10 -12.4% 5.387e+10 perf-stat.i.cpu-cycles 73.37 ± 6% +225.2% 238.64 ± 5% perf-stat.i.cpu-migrations 142.91 +5.8% 151.21 perf-stat.i.cycles-between-cache-misses 0.00 ± 6% +0.0 0.00 ± 2% perf-stat.i.dTLB-load-miss-rate% 102744 ± 4% +339.3% 451367 ± 2% perf-stat.i.dTLB-load-misses 8.149e+09 +50.8% 1.229e+10 perf-stat.i.dTLB-loads 0.00 +0.2 0.16 perf-stat.i.dTLB-store-miss-rate% 44331 +15095.5% 6736444 perf-stat.i.dTLB-store-misses 2.384e+09 +73.7% 4.143e+09 perf-stat.i.dTLB-stores 72.35 -51.8 20.54 perf-stat.i.iTLB-load-miss-rate% 181195 +1626.2% 3127850 perf-stat.i.iTLB-load-misses 74440 +16174.6% 12114813 perf-stat.i.iTLB-loads 3.261e+10 +75.7% 5.73e+10 perf-stat.i.instructions 189288 -89.2% 20469 perf-stat.i.instructions-per-iTLB-miss 0.53 +99.6% 1.06 perf-stat.i.ipc 11179 +19457.4% 2186390 perf-stat.i.minor-faults 29989451 ± 2% +77.0% 53068965 perf-stat.i.node-loads 2.605e+08 -41.8% 1.516e+08 perf-stat.i.node-stores 11179 +19457.4% 2186390 perf-stat.i.page-faults 21.83 +6.2% 23.18 perf-stat.overall.MPKI 0.03 ± 4% +0.1 0.12 perf-stat.overall.branch-miss-rate% 60.85 -33.8 27.02 perf-stat.overall.cache-miss-rate% 1.89 -50.1% 0.94 perf-stat.overall.cpi 141.95 +5.7% 150.08 perf-stat.overall.cycles-between-cache-misses 0.00 ± 4% +0.0 0.00 ± 2% perf-stat.overall.dTLB-load-miss-rate% 0.00 +0.2 0.16 perf-stat.overall.dTLB-store-miss-rate% 70.89 -50.4 20.53 perf-stat.overall.iTLB-load-miss-rate% 179885 -89.8% 18310 perf-stat.overall.instructions-per-iTLB-miss 0.53 +100.5% 1.06 perf-stat.overall.ipc 0.00 ± 52% -0.0 0.00 ± 83% perf-stat.overall.node-load-miss-rate% 3262 +78.3% 5816 perf-stat.overall.path-length 1.034e+10 +42.0% 1.468e+10 perf-stat.ps.branch-instructions 3360586 ± 4% +409.9% 17134421 perf-stat.ps.branch-misses 4.317e+08 -17.1% 3.577e+08 perf-stat.ps.cache-misses 7.094e+08 +86.6% 1.324e+09 perf-stat.ps.cache-references 2244 +16002.1% 361379 perf-stat.ps.context-switches 6.128e+10 -12.4% 5.368e+10 perf-stat.ps.cpu-cycles 73.14 ± 6% +225.4% 238.03 ± 5% perf-stat.ps.cpu-migrations 102439 ± 4% +339.3% 449993 ± 2% perf-stat.ps.dTLB-load-misses 8.121e+09 +50.8% 1.225e+10 perf-stat.ps.dTLB-loads 44192 +15090.9% 6713262 perf-stat.ps.dTLB-store-misses 2.376e+09 +73.7% 4.129e+09 perf-stat.ps.dTLB-stores 180678 +1626.1% 3118697 perf-stat.ps.iTLB-load-misses 74196 +16172.4% 12073619 perf-stat.ps.iTLB-loads 3.25e+10 +75.7% 5.71e+10 perf-stat.ps.instructions 11142 +19454.2% 2178861 perf-stat.ps.minor-faults 29890459 ± 2% +76.9% 52880337 perf-stat.ps.node-loads 2.596e+08 -41.8% 1.511e+08 perf-stat.ps.node-stores 11142 +19454.2% 2178861 perf-stat.ps.page-faults 9.835e+12 +75.8% 1.728e+13 perf-stat.total.instructions 42.83 ± 2% -42.8 0.00 perf-profile.calltrace.cycles-pp.do_huge_pmd_wp_page.__handle_mm_fault.handle_mm_fault.do_page_fault.page_fault 41.07 ± 2% -41.1 0.00 perf-profile.calltrace.cycles-pp.copy_user_huge_page.do_huge_pmd_wp_page.__handle_mm_fault.handle_mm_fault.do_page_fault 40.86 ± 2% -40.9 0.00 perf-profile.calltrace.cycles-pp.copy_subpage.copy_user_huge_page.do_huge_pmd_wp_page.__handle_mm_fault.handle_mm_fault 40.43 ± 2% -40.4 0.00 perf-profile.calltrace.cycles-pp.copy_page.copy_subpage.copy_user_huge_page.do_huge_pmd_wp_page.__handle_mm_fault 94.37 -33.5 60.90 perf-profile.calltrace.cycles-pp.do_access 37.81 -2.0 35.81 perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.do_page_fault.page_fault.do_access 37.85 -1.4 36.49 perf-profile.calltrace.cycles-pp.handle_mm_fault.do_page_fault.page_fault.do_access 0.78 ± 5% -0.1 0.69 ± 7% perf-profile.calltrace.cycles-pp.ret_from_fork 0.78 ± 5% -0.1 0.69 ± 7% perf-profile.calltrace.cycles-pp.kthread.ret_from_fork 0.77 ± 5% -0.1 0.69 ± 7% perf-profile.calltrace.cycles-pp.worker_thread.kthread.ret_from_fork 0.77 ± 5% -0.1 0.69 ± 7% perf-profile.calltrace.cycles-pp.process_one_work.worker_thread.kthread.ret_from_fork 0.75 ± 5% -0.1 0.67 ± 7% perf-profile.calltrace.cycles-pp.memcpy_erms.drm_fb_helper_dirty_work.process_one_work.worker_thread.kthread 0.77 ± 5% -0.1 0.69 ± 7% perf-profile.calltrace.cycles-pp.drm_fb_helper_dirty_work.process_one_work.worker_thread.kthread.ret_from_fork 0.00 +0.5 0.53 ± 3% perf-profile.calltrace.cycles-pp.__sched_text_start.schedule_idle.do_idle.cpu_startup_entry.start_secondary 0.00 +0.6 0.55 ± 3% perf-profile.calltrace.cycles-pp.schedule_idle.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64 0.00 +0.7 0.73 ± 6% perf-profile.calltrace.cycles-pp.arch_stack_walk.stack_trace_save_tsk.__account_scheduler_latency.enqueue_entity.enqueue_task_fair 0.00 +0.8 0.78 ± 8% perf-profile.calltrace.cycles-pp.__sched_text_start.schedule.io_schedule.__lock_page.do_wp_page 0.00 +0.8 0.79 ± 8% perf-profile.calltrace.cycles-pp.schedule.io_schedule.__lock_page.do_wp_page.handle_pte_fault 0.00 +0.8 0.81 ± 8% perf-profile.calltrace.cycles-pp.io_schedule.__lock_page.do_wp_page.handle_pte_fault.__handle_mm_fault 0.00 +0.8 0.82 ± 5% perf-profile.calltrace.cycles-pp.stack_trace_save_tsk.__account_scheduler_latency.enqueue_entity.enqueue_task_fair.activate_task 0.00 +0.9 0.90 ± 3% perf-profile.calltrace.cycles-pp.__pagevec_lru_add_fn.pagevec_lru_move_fn.__lru_cache_add.wp_page_copy.do_wp_page 0.00 +1.0 1.00 ± 5% perf-profile.calltrace.cycles-pp.__account_scheduler_latency.enqueue_entity.enqueue_task_fair.activate_task.ttwu_do_activate 0.00 +1.0 1.01 ± 2% perf-profile.calltrace.cycles-pp.native_flush_tlb_one_user.flush_tlb_func_common.flush_tlb_mm_range.ptep_clear_flush.wp_page_copy 0.00 +1.1 1.06 ± 8% perf-profile.calltrace.cycles-pp.free_pcppages_bulk.free_unref_page_list.release_pages.tlb_flush_mmu.unmap_page_range 0.00 +1.1 1.13 ± 2% perf-profile.calltrace.cycles-pp.flush_tlb_func_common.flush_tlb_mm_range.ptep_clear_flush.wp_page_copy.do_wp_page 0.00 +1.1 1.14 ± 8% perf-profile.calltrace.cycles-pp.__lock_page.do_wp_page.handle_pte_fault.__handle_mm_fault.handle_mm_fault 0.00 +1.1 1.14 perf-profile.calltrace.cycles-pp.mem_cgroup_try_charge_delay.wp_page_copy.do_wp_page.handle_pte_fault.__handle_mm_fault 20.47 +1.2 21.64 ± 2% perf-profile.calltrace.cycles-pp.do_rw_once 0.00 +1.3 1.28 ± 16% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.release_pages.tlb_flush_mmu.unmap_page_range 0.00 +1.3 1.29 ± 6% perf-profile.calltrace.cycles-pp.enqueue_entity.enqueue_task_fair.activate_task.ttwu_do_activate.try_to_wake_up 0.00 +1.3 1.29 ± 15% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.release_pages.tlb_flush_mmu.unmap_page_range.unmap_vmas 0.00 +1.4 1.35 ± 6% perf-profile.calltrace.cycles-pp.free_unref_page_list.release_pages.tlb_flush_mmu.unmap_page_range.unmap_vmas 0.00 +1.4 1.35 ± 6% perf-profile.calltrace.cycles-pp.enqueue_task_fair.activate_task.ttwu_do_activate.try_to_wake_up.autoremove_wake_function 0.00 +1.4 1.36 ± 5% perf-profile.calltrace.cycles-pp.activate_task.ttwu_do_activate.try_to_wake_up.autoremove_wake_function.__wake_up_common 0.00 +1.4 1.36 ± 5% perf-profile.calltrace.cycles-pp.ttwu_do_activate.try_to_wake_up.autoremove_wake_function.__wake_up_common.wake_up_page_bit 0.00 +1.4 1.45 perf-profile.calltrace.cycles-pp.get_page_from_freelist.__alloc_pages_nodemask.alloc_pages_vma.wp_page_copy.do_wp_page 0.00 +1.6 1.59 perf-profile.calltrace.cycles-pp.pagevec_lru_move_fn.__lru_cache_add.wp_page_copy.do_wp_page.handle_pte_fault 0.00 +1.6 1.64 ± 2% perf-profile.calltrace.cycles-pp.flush_tlb_mm_range.ptep_clear_flush.wp_page_copy.do_wp_page.handle_pte_fault 0.00 +1.7 1.75 perf-profile.calltrace.cycles-pp.__lru_cache_add.wp_page_copy.do_wp_page.handle_pte_fault.__handle_mm_fault 0.00 +1.8 1.77 perf-profile.calltrace.cycles-pp.ptep_clear_flush.wp_page_copy.do_wp_page.handle_pte_fault.__handle_mm_fault 0.00 +2.0 1.98 perf-profile.calltrace.cycles-pp.__alloc_pages_nodemask.alloc_pages_vma.wp_page_copy.do_wp_page.handle_pte_fault 0.00 +2.2 2.17 ± 5% perf-profile.calltrace.cycles-pp.try_to_wake_up.autoremove_wake_function.__wake_up_common.wake_up_page_bit.do_wp_page 0.00 +2.2 2.19 ± 4% perf-profile.calltrace.cycles-pp.autoremove_wake_function.__wake_up_common.wake_up_page_bit.do_wp_page.handle_pte_fault 0.00 +2.2 2.23 perf-profile.calltrace.cycles-pp.alloc_pages_vma.wp_page_copy.do_wp_page.handle_pte_fault.__handle_mm_fault 0.00 +2.3 2.26 ± 5% perf-profile.calltrace.cycles-pp.__wake_up_common.wake_up_page_bit.do_wp_page.handle_pte_fault.__handle_mm_fault 0.93 ± 9% +2.5 3.45 ± 10% perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry 0.00 +2.6 2.65 ± 5% perf-profile.calltrace.cycles-pp.wake_up_page_bit.do_wp_page.handle_pte_fault.__handle_mm_fault.handle_mm_fault 0.00 +2.6 2.65 ± 6% perf-profile.calltrace.cycles-pp.poll_idle.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry 0.00 +3.5 3.54 ± 7% perf-profile.calltrace.cycles-pp.release_pages.tlb_flush_mmu.unmap_page_range.unmap_vmas.exit_mmap 0.00 +3.7 3.73 ± 7% perf-profile.calltrace.cycles-pp.tlb_flush_mmu.unmap_page_range.unmap_vmas.exit_mmap.mmput 0.78 ± 5% +3.8 4.58 ± 6% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe 0.78 ± 5% +3.8 4.58 ± 6% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.76 ± 5% +3.8 4.57 ± 6% perf-profile.calltrace.cycles-pp.__x64_sys_exit_group.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.76 ± 5% +3.8 4.57 ± 6% perf-profile.calltrace.cycles-pp.do_group_exit.__x64_sys_exit_group.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.76 ± 5% +3.8 4.57 ± 6% perf-profile.calltrace.cycles-pp.do_exit.do_group_exit.__x64_sys_exit_group.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.75 ± 6% +3.8 4.57 ± 6% perf-profile.calltrace.cycles-pp.mmput.do_exit.do_group_exit.__x64_sys_exit_group.do_syscall_64 0.75 ± 6% +3.8 4.57 ± 6% perf-profile.calltrace.cycles-pp.exit_mmap.mmput.do_exit.do_group_exit.__x64_sys_exit_group 0.00 +4.4 4.37 ± 6% perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.exit_mmap.mmput.do_exit 0.00 +4.4 4.38 ± 6% perf-profile.calltrace.cycles-pp.unmap_vmas.exit_mmap.mmput.do_exit.do_group_exit 1.03 ± 8% +5.2 6.24 ± 7% perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry.start_secondary 1.04 ± 7% +5.2 6.25 ± 7% perf-profile.calltrace.cycles-pp.cpuidle_enter.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64 37.96 +5.4 43.33 perf-profile.calltrace.cycles-pp.page_fault.do_access 1.10 ± 8% +6.6 7.70 ± 6% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64 1.10 ± 8% +6.6 7.72 ± 6% perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64 1.10 ± 8% +6.6 7.72 ± 6% perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64 1.19 ± 9% +7.2 8.35 ± 7% perf-profile.calltrace.cycles-pp.secondary_startup_64 0.00 +10.4 10.42 ± 3% perf-profile.calltrace.cycles-pp.copy_page.copy_user_highpage.wp_page_copy.do_wp_page.handle_pte_fault 0.00 +10.6 10.59 ± 3% perf-profile.calltrace.cycles-pp.copy_user_highpage.wp_page_copy.do_wp_page.handle_pte_fault.__handle_mm_fault 0.00 +13.3 13.26 ± 2% perf-profile.calltrace.cycles-pp.reuse_swap_page.do_wp_page.handle_pte_fault.__handle_mm_fault.handle_mm_fault 0.00 +20.9 20.90 ± 3% perf-profile.calltrace.cycles-pp.wp_page_copy.do_wp_page.handle_pte_fault.__handle_mm_fault.handle_mm_fault 0.00 +39.6 39.59 ± 2% perf-profile.calltrace.cycles-pp.do_wp_page.handle_pte_fault.__handle_mm_fault.handle_mm_fault.do_page_fault 0.00 +40.1 40.11 ± 2% perf-profile.calltrace.cycles-pp.handle_pte_fault.__handle_mm_fault.handle_mm_fault.do_page_fault.page_fault 42.83 ± 2% -42.4 0.39 ± 6% perf-profile.children.cycles-pp.do_huge_pmd_wp_page 41.09 ± 2% -41.1 0.00 perf-profile.children.cycles-pp.copy_user_huge_page 40.87 ± 2% -40.9 0.00 perf-profile.children.cycles-pp.copy_subpage 40.70 ± 2% -30.3 10.44 ± 3% perf-profile.children.cycles-pp.copy_page 42.88 -18.1 24.79 perf-profile.children.cycles-pp.do_rw_once 72.03 -13.4 58.66 perf-profile.children.cycles-pp.do_access 0.71 ± 6% -0.5 0.18 ± 11% perf-profile.children.cycles-pp.tlb_finish_mmu 1.13 ± 8% -0.5 0.62 ± 2% perf-profile.children.cycles-pp.apic_timer_interrupt 0.86 ± 8% -0.3 0.55 ± 3% perf-profile.children.cycles-pp.smp_apic_timer_interrupt 0.69 ± 11% -0.3 0.39 ± 4% perf-profile.children.cycles-pp.hrtimer_interrupt 0.53 ± 10% -0.2 0.29 ± 7% perf-profile.children.cycles-pp.__hrtimer_run_queues 0.42 ± 11% -0.2 0.19 ± 11% perf-profile.children.cycles-pp.tick_sched_timer 0.39 ± 10% -0.2 0.17 ± 12% perf-profile.children.cycles-pp.tick_sched_handle 0.37 ± 10% -0.2 0.17 ± 12% perf-profile.children.cycles-pp.update_process_times 0.24 ± 10% -0.2 0.09 ± 17% perf-profile.children.cycles-pp.scheduler_tick 0.20 ± 7% -0.1 0.06 ± 20% perf-profile.children.cycles-pp.task_tick_fair 0.79 ± 5% -0.1 0.69 ± 7% perf-profile.children.cycles-pp.ret_from_fork 0.78 ± 5% -0.1 0.69 ± 7% perf-profile.children.cycles-pp.kthread 0.77 ± 6% -0.1 0.68 ± 7% perf-profile.children.cycles-pp.memcpy_erms 0.77 ± 5% -0.1 0.69 ± 7% perf-profile.children.cycles-pp.worker_thread 0.77 ± 5% -0.1 0.69 ± 7% perf-profile.children.cycles-pp.process_one_work 0.77 ± 5% -0.1 0.69 ± 7% perf-profile.children.cycles-pp.drm_fb_helper_dirty_work 0.26 ± 7% -0.1 0.18 ± 17% perf-profile.children.cycles-pp.vm_mmap_pgoff 0.25 ± 9% -0.1 0.18 ± 15% perf-profile.children.cycles-pp.__get_user_pages 0.25 ± 8% -0.1 0.18 ± 15% perf-profile.children.cycles-pp.mmap64 0.25 ± 8% -0.1 0.18 ± 15% perf-profile.children.cycles-pp.__mm_populate 0.25 ± 8% -0.1 0.18 ± 15% perf-profile.children.cycles-pp.populate_vma_page_range 0.25 ± 8% -0.1 0.18 ± 18% perf-profile.children.cycles-pp.do_huge_pmd_anonymous_page 0.25 ± 9% -0.1 0.19 ± 17% perf-profile.children.cycles-pp.ksys_mmap_pgoff 0.23 ± 9% -0.1 0.17 ± 16% perf-profile.children.cycles-pp.clear_huge_page 0.11 ± 10% -0.1 0.05 ± 58% perf-profile.children.cycles-pp.io_serial_out 0.22 ± 8% -0.1 0.17 ± 15% perf-profile.children.cycles-pp.clear_page_erms 0.22 ± 9% -0.1 0.16 ± 17% perf-profile.children.cycles-pp.clear_subpage 0.00 +0.1 0.05 ± 8% perf-profile.children.cycles-pp.delay_tsc 0.00 +0.1 0.06 ± 9% perf-profile.children.cycles-pp.reweight_entity 0.00 +0.1 0.06 ± 9% perf-profile.children.cycles-pp.hrtimer_next_event_without 0.00 +0.1 0.06 ± 9% perf-profile.children.cycles-pp.cpuidle_governor_latency_req 0.00 +0.1 0.06 ± 11% perf-profile.children.cycles-pp.nr_iowait_cpu 0.00 +0.1 0.06 ± 11% perf-profile.children.cycles-pp.pmd_pfn 0.00 +0.1 0.06 ± 11% perf-profile.children.cycles-pp.rcu_idle_exit 0.00 +0.1 0.06 ± 13% perf-profile.children.cycles-pp.page_evictable 0.00 +0.1 0.07 ± 6% perf-profile.children.cycles-pp.uncharge_page 0.00 +0.1 0.07 ± 6% perf-profile.children.cycles-pp.pmd_devmap_trans_unstable 0.00 +0.1 0.07 ± 6% perf-profile.children.cycles-pp.delayacct_end 0.00 +0.1 0.07 ± 12% perf-profile.children.cycles-pp.finish_task_switch 0.00 +0.1 0.07 ± 21% perf-profile.children.cycles-pp.mem_cgroup_from_task 0.00 +0.1 0.07 ± 10% perf-profile.children.cycles-pp.free_unref_page_prepare 0.00 +0.1 0.07 ± 14% perf-profile.children.cycles-pp.read_tsc 0.00 +0.1 0.07 ± 10% perf-profile.children.cycles-pp.__update_load_avg_se 0.00 +0.1 0.07 perf-profile.children.cycles-pp.tick_nohz_idle_exit 0.00 +0.1 0.08 ± 6% perf-profile.children.cycles-pp.check_preempt_curr 0.03 ±100% +0.1 0.10 ± 8% perf-profile.children.cycles-pp.native_write_msr 0.00 +0.1 0.08 ± 5% perf-profile.children.cycles-pp.vm_normal_page 0.00 +0.1 0.08 ± 8% perf-profile.children.cycles-pp.ttwu_do_wakeup 0.00 +0.1 0.08 perf-profile.children.cycles-pp.find_next_bit 0.00 +0.1 0.08 ± 5% perf-profile.children.cycles-pp.kernel_text_address 0.00 +0.1 0.08 ± 10% perf-profile.children.cycles-pp.update_cfs_group 0.00 +0.1 0.09 ± 5% perf-profile.children.cycles-pp.fpregs_assert_state_consistent 0.00 +0.1 0.09 ± 9% perf-profile.children.cycles-pp.mem_cgroup_update_lru_size 0.00 +0.1 0.09 ± 14% perf-profile.children.cycles-pp.__update_load_avg_cfs_rq 0.00 +0.1 0.09 ± 13% perf-profile.children.cycles-pp.update_rq_clock 0.00 +0.1 0.10 ± 13% perf-profile.children.cycles-pp.get_vma_policy 0.00 +0.1 0.10 ± 11% perf-profile.children.cycles-pp.sched_clock 0.00 +0.1 0.10 ± 11% perf-profile.children.cycles-pp.native_sched_clock 0.00 +0.1 0.10 ± 4% perf-profile.children.cycles-pp.mem_cgroup_page_lruvec 0.00 +0.1 0.10 ± 8% perf-profile.children.cycles-pp.perf_exclude_event 0.07 ± 6% +0.1 0.17 ± 11% perf-profile.children.cycles-pp.update_curr 0.00 +0.1 0.10 ± 7% perf-profile.children.cycles-pp.__kernel_text_address 0.00 +0.1 0.10 ± 10% perf-profile.children.cycles-pp.mem_cgroup_uncharge_list 0.00 +0.1 0.11 ± 10% perf-profile.children.cycles-pp.orc_find 0.00 +0.1 0.11 ± 4% perf-profile.children.cycles-pp.__switch_to_asm 0.00 +0.1 0.11 ± 10% perf-profile.children.cycles-pp.sched_clock_cpu 0.00 +0.1 0.11 ± 6% perf-profile.children.cycles-pp.ktime_get 0.00 +0.1 0.11 ± 12% perf-profile.children.cycles-pp.stack_trace_consume_entry_nosched 0.00 +0.1 0.12 ± 7% perf-profile.children.cycles-pp.lru_cache_add_active_or_unevictable 0.00 +0.1 0.12 ± 11% perf-profile.children.cycles-pp.__unwind_start 0.00 +0.1 0.12 ± 3% perf-profile.children.cycles-pp.unwind_get_return_address 0.00 +0.1 0.13 ± 8% perf-profile.children.cycles-pp.down_read_trylock 0.12 ± 6% +0.1 0.26 ± 7% perf-profile.children.cycles-pp.___might_sleep 0.00 +0.1 0.13 ± 8% perf-profile.children.cycles-pp.free_unref_page_commit 0.00 +0.1 0.14 ± 8% perf-profile.children.cycles-pp._raw_spin_unlock_irqrestore 0.00 +0.1 0.14 ± 11% perf-profile.children.cycles-pp.__switch_to 0.00 +0.1 0.14 ± 5% perf-profile.children.cycles-pp.prepare_exit_to_usermode 0.00 +0.1 0.15 ± 12% perf-profile.children.cycles-pp.__next_timer_interrupt 0.00 +0.2 0.15 ± 12% perf-profile.children.cycles-pp.select_task_rq_fair 0.00 +0.2 0.15 ± 5% perf-profile.children.cycles-pp.__orc_find 0.00 +0.2 0.17 ± 5% perf-profile.children.cycles-pp._raw_spin_lock_irq 0.00 +0.2 0.17 ± 4% perf-profile.children.cycles-pp.__mod_zone_page_state 0.00 +0.2 0.17 ± 4% perf-profile.children.cycles-pp.up_read 0.08 ± 10% +0.2 0.26 ± 8% perf-profile.children.cycles-pp.update_load_avg 0.00 +0.2 0.19 ± 8% perf-profile.children.cycles-pp.__might_sleep 0.00 +0.2 0.19 ± 4% perf-profile.children.cycles-pp.free_pages_and_swap_cache 0.00 +0.2 0.20 ± 8% perf-profile.children.cycles-pp.set_next_entity 0.00 +0.2 0.21 ± 9% perf-profile.children.cycles-pp.get_next_timer_interrupt 0.00 +0.2 0.21 ± 8% perf-profile.children.cycles-pp.__count_memcg_events 0.00 +0.2 0.22 ± 8% perf-profile.children.cycles-pp.unlock_page 0.00 +0.2 0.22 ± 3% perf-profile.children.cycles-pp.__list_add_valid 0.00 +0.2 0.24 ± 3% perf-profile.children.cycles-pp.vmacache_find 0.00 +0.2 0.24 ± 7% perf-profile.children.cycles-pp.__split_huge_pmd_locked 0.00 +0.2 0.25 ± 9% perf-profile.children.cycles-pp.__split_huge_pmd 0.00 +0.2 0.25 ± 10% perf-profile.children.cycles-pp.tick_nohz_next_event 0.00 +0.3 0.26 ± 3% perf-profile.children.cycles-pp.find_vma 0.00 +0.3 0.27 ± 3% perf-profile.children.cycles-pp.__mod_memcg_state 0.00 +0.3 0.29 ± 2% perf-profile.children.cycles-pp.put_page 0.00 +0.3 0.29 ± 2% perf-profile.children.cycles-pp._find_next_bit 0.00 +0.3 0.30 ± 4% perf-profile.children.cycles-pp.swapgs_restore_regs_and_return_to_usermode 0.00 +0.3 0.30 ± 4% perf-profile.children.cycles-pp.mem_cgroup_throttle_swaprate 0.00 +0.3 0.31 ± 5% perf-profile.children.cycles-pp.pick_next_task_fair 0.00 +0.3 0.32 ± 4% perf-profile.children.cycles-pp.mem_cgroup_charge_statistics 0.00 +0.3 0.33 ± 3% perf-profile.children.cycles-pp.__mod_node_page_state 0.00 +0.3 0.33 ± 9% perf-profile.children.cycles-pp.tick_nohz_get_sleep_length 0.00 +0.3 0.35 ± 3% perf-profile.children.cycles-pp.get_mem_cgroup_from_mm 0.00 +0.3 0.35 ± 3% perf-profile.children.cycles-pp.cpumask_any_but 0.00 +0.4 0.35 ± 4% perf-profile.children.cycles-pp.page_add_new_anon_rmap 0.00 +0.4 0.36 ± 4% perf-profile.children.cycles-pp.__mod_lruvec_state 0.00 +0.4 0.38 ± 11% perf-profile.children.cycles-pp.dequeue_entity 0.00 +0.4 0.44 ± 9% perf-profile.children.cycles-pp.dequeue_task_fair 1.27 ± 12% +0.5 1.73 ± 4% perf-profile.children.cycles-pp.get_page_from_freelist 0.00 +0.5 0.47 ± 2% perf-profile.children.cycles-pp.page_remove_rmap 0.00 +0.5 0.48 ± 4% perf-profile.children.cycles-pp.mem_cgroup_commit_charge 0.00 +0.5 0.49 ± 4% perf-profile.children.cycles-pp.mem_cgroup_try_charge 0.00 +0.5 0.52 ± 5% perf-profile.children.cycles-pp.unwind_next_frame 0.08 ± 74% +0.5 0.63 ± 28% perf-profile.children.cycles-pp.start_kernel 0.00 +0.6 0.56 ± 6% perf-profile.children.cycles-pp.menu_select 0.00 +0.6 0.59 ± 5% perf-profile.children.cycles-pp.schedule_idle 0.06 ± 6% +0.7 0.78 ± 4% perf-profile.children.cycles-pp.___perf_sw_event 0.01 ±173% +0.8 0.80 ± 3% perf-profile.children.cycles-pp.__list_del_entry_valid 0.04 ± 58% +0.8 0.85 ± 8% perf-profile.children.cycles-pp._raw_spin_lock 0.00 +0.8 0.82 ± 6% perf-profile.children.cycles-pp.arch_stack_walk 0.06 ± 6% +0.8 0.91 ± 4% perf-profile.children.cycles-pp.__perf_sw_event 0.00 +0.9 0.89 ± 9% perf-profile.children.cycles-pp.schedule 0.00 +0.9 0.90 ± 9% perf-profile.children.cycles-pp.io_schedule 0.00 +0.9 0.91 ± 5% perf-profile.children.cycles-pp.stack_trace_save_tsk 1.27 ± 12% +1.0 2.31 ± 3% perf-profile.children.cycles-pp.__alloc_pages_nodemask 0.00 +1.1 1.05 ± 2% perf-profile.children.cycles-pp.__pagevec_lru_add_fn 0.00 +1.1 1.10 ± 5% perf-profile.children.cycles-pp.__account_scheduler_latency 0.00 +1.2 1.16 ± 7% perf-profile.children.cycles-pp.free_pcppages_bulk 0.00 +1.2 1.19 ± 4% perf-profile.children.cycles-pp.native_flush_tlb_one_user 0.00 +1.3 1.26 ± 8% perf-profile.children.cycles-pp.__lock_page 0.00 +1.3 1.32 ± 4% perf-profile.children.cycles-pp.flush_tlb_func_common 1.27 ± 12% +1.3 2.60 ± 3% perf-profile.children.cycles-pp.alloc_pages_vma 0.00 +1.4 1.37 ± 4% perf-profile.children.cycles-pp.mem_cgroup_try_charge_delay 0.00 +1.4 1.37 ± 3% perf-profile.children.cycles-pp.sync_regs 0.04 ± 58% +1.4 1.46 ± 7% perf-profile.children.cycles-pp.__sched_text_start 0.00 +1.4 1.45 ± 6% perf-profile.children.cycles-pp.enqueue_entity 0.00 +1.5 1.48 ± 6% perf-profile.children.cycles-pp.free_unref_page_list 0.00 +1.5 1.51 ± 5% perf-profile.children.cycles-pp.enqueue_task_fair 0.00 +1.5 1.52 ± 5% perf-profile.children.cycles-pp.activate_task 0.00 +1.5 1.52 ± 5% perf-profile.children.cycles-pp.ttwu_do_activate 0.64 ± 27% +1.7 2.35 ± 10% perf-profile.children.cycles-pp._raw_spin_lock_irqsave 0.07 ± 10% +1.8 1.84 ± 3% perf-profile.children.cycles-pp.native_irq_return_iret 0.04 ± 58% +1.8 1.86 ± 2% perf-profile.children.cycles-pp.pagevec_lru_move_fn 0.01 ±173% +1.9 1.92 ± 3% perf-profile.children.cycles-pp.flush_tlb_mm_range 0.06 ± 14% +2.0 2.03 ± 3% perf-profile.children.cycles-pp.__lru_cache_add 0.60 ± 29% +2.0 2.58 ± 11% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath 0.00 +2.1 2.06 ± 3% perf-profile.children.cycles-pp.ptep_clear_flush 0.06 ± 6% +2.4 2.43 ± 5% perf-profile.children.cycles-pp.try_to_wake_up 0.01 ±173% +2.4 2.45 ± 5% perf-profile.children.cycles-pp.autoremove_wake_function 0.03 ±100% +2.5 2.53 ± 5% perf-profile.children.cycles-pp.__wake_up_common 1.01 ± 10% +2.7 3.70 ± 9% perf-profile.children.cycles-pp.intel_idle 0.06 ± 20% +2.9 2.95 ± 5% perf-profile.children.cycles-pp.wake_up_page_bit 0.00 +2.9 2.90 ± 9% perf-profile.children.cycles-pp.poll_idle 0.71 ± 6% +3.2 3.88 ± 7% perf-profile.children.cycles-pp.release_pages 0.71 ± 6% +3.2 3.92 ± 7% perf-profile.children.cycles-pp.tlb_flush_mmu 43.16 ± 2% +3.6 46.72 ± 2% perf-profile.children.cycles-pp.page_fault 1.73 ± 4% +3.6 5.37 ± 6% perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe 1.73 ± 4% +3.6 5.37 ± 6% perf-profile.children.cycles-pp.do_syscall_64 0.76 ± 6% +3.8 4.57 ± 6% perf-profile.children.cycles-pp.__x64_sys_exit_group 0.76 ± 6% +3.8 4.57 ± 6% perf-profile.children.cycles-pp.do_group_exit 0.76 ± 6% +3.8 4.57 ± 6% perf-profile.children.cycles-pp.do_exit 0.76 ± 5% +3.8 4.57 ± 6% perf-profile.children.cycles-pp.mmput 0.76 ± 5% +3.8 4.57 ± 6% perf-profile.children.cycles-pp.exit_mmap 0.03 ±100% +4.4 4.38 ± 6% perf-profile.children.cycles-pp.unmap_vmas 0.01 ±173% +4.4 4.38 ± 6% perf-profile.children.cycles-pp.unmap_page_range 1.12 ± 9% +5.6 6.75 ± 8% perf-profile.children.cycles-pp.cpuidle_enter_state 1.12 ± 9% +5.6 6.75 ± 8% perf-profile.children.cycles-pp.cpuidle_enter 1.10 ± 8% +6.6 7.72 ± 6% perf-profile.children.cycles-pp.start_secondary 1.19 ± 9% +7.2 8.34 ± 7% perf-profile.children.cycles-pp.do_idle 1.19 ± 9% +7.2 8.35 ± 7% perf-profile.children.cycles-pp.secondary_startup_64 1.19 ± 9% +7.2 8.35 ± 7% perf-profile.children.cycles-pp.cpu_startup_entry 0.07 ± 26% +10.5 10.59 ± 3% perf-profile.children.cycles-pp.copy_user_highpage 0.21 ± 5% +13.2 13.39 ± 2% perf-profile.children.cycles-pp.reuse_swap_page 0.11 ± 32% +20.8 20.95 ± 3% perf-profile.children.cycles-pp.wp_page_copy 0.12 ± 29% +39.5 39.62 ± 2% perf-profile.children.cycles-pp.do_wp_page 0.15 ± 22% +40.0 40.14 ± 2% perf-profile.children.cycles-pp.handle_pte_fault 40.29 ± 2% -29.9 10.38 ± 3% perf-profile.self.cycles-pp.copy_page 41.94 -17.9 24.04 perf-profile.self.cycles-pp.do_rw_once 0.11 ± 10% -0.1 0.05 ± 58% perf-profile.self.cycles-pp.io_serial_out 0.22 ± 9% -0.1 0.16 ± 17% perf-profile.self.cycles-pp.clear_page_erms 0.05 ± 9% +0.0 0.09 ± 13% perf-profile.self.cycles-pp.update_curr 0.06 ± 11% +0.1 0.11 ± 7% perf-profile.self.cycles-pp.update_load_avg 0.00 +0.1 0.05 ± 8% perf-profile.self.cycles-pp.reweight_entity 0.00 +0.1 0.05 ± 8% perf-profile.self.cycles-pp.finish_task_switch 0.00 +0.1 0.05 ± 8% perf-profile.self.cycles-pp.delay_tsc 0.00 +0.1 0.06 ± 9% perf-profile.self.cycles-pp.dequeue_task_fair 0.00 +0.1 0.06 ± 15% perf-profile.self.cycles-pp.wake_up_page_bit 0.00 +0.1 0.06 ± 7% perf-profile.self.cycles-pp.pmd_pfn 0.00 +0.1 0.06 ± 11% perf-profile.self.cycles-pp.free_unref_page_prepare 0.00 +0.1 0.06 ± 11% perf-profile.self.cycles-pp.nr_iowait_cpu 0.00 +0.1 0.06 perf-profile.self.cycles-pp.find_next_bit 0.00 +0.1 0.06 ± 6% perf-profile.self.cycles-pp.__next_timer_interrupt 0.00 +0.1 0.06 ± 13% perf-profile.self.cycles-pp.uncharge_page 0.00 +0.1 0.06 ± 6% perf-profile.self.cycles-pp.vm_normal_page 0.00 +0.1 0.07 ± 7% perf-profile.self.cycles-pp.pmd_devmap_trans_unstable 0.00 +0.1 0.07 ± 17% perf-profile.self.cycles-pp.mem_cgroup_from_task 0.00 +0.1 0.07 ± 16% perf-profile.self.cycles-pp.stack_trace_consume_entry_nosched 0.00 +0.1 0.07 ± 12% perf-profile.self.cycles-pp.read_tsc 0.00 +0.1 0.07 ± 10% perf-profile.self.cycles-pp.__update_load_avg_se 0.00 +0.1 0.08 ± 6% perf-profile.self.cycles-pp.stack_trace_save_tsk 0.03 ±100% +0.1 0.10 ± 8% perf-profile.self.cycles-pp.native_write_msr 0.00 +0.1 0.08 ± 15% perf-profile.self.cycles-pp.__update_load_avg_cfs_rq 0.00 +0.1 0.08 ± 5% perf-profile.self.cycles-pp.mem_cgroup_update_lru_size 0.00 +0.1 0.08 ± 5% perf-profile.self.cycles-pp.fpregs_assert_state_consistent 0.00 +0.1 0.08 ± 13% perf-profile.self.cycles-pp.update_cfs_group 0.00 +0.1 0.09 ± 9% perf-profile.self.cycles-pp.get_vma_policy 0.00 +0.1 0.09 ± 9% perf-profile.self.cycles-pp.free_unref_page_list 0.00 +0.1 0.09 ± 11% perf-profile.self.cycles-pp.perf_exclude_event 0.00 +0.1 0.09 ± 20% perf-profile.self.cycles-pp.select_task_rq_fair 0.00 +0.1 0.09 ± 7% perf-profile.self.cycles-pp.mem_cgroup_charge_statistics 0.00 +0.1 0.09 ± 7% perf-profile.self.cycles-pp.mem_cgroup_page_lruvec 0.00 +0.1 0.10 ± 5% perf-profile.self.cycles-pp.mem_cgroup_try_charge 0.00 +0.1 0.10 ± 9% perf-profile.self.cycles-pp._raw_spin_unlock_irqrestore 0.00 +0.1 0.10 ± 11% perf-profile.self.cycles-pp.native_sched_clock 0.00 +0.1 0.10 ± 8% perf-profile.self.cycles-pp.free_unref_page_commit 0.00 +0.1 0.10 ± 10% perf-profile.self.cycles-pp.do_idle 0.00 +0.1 0.10 ± 14% perf-profile.self.cycles-pp.orc_find 0.00 +0.1 0.10 ± 10% perf-profile.self.cycles-pp.alloc_pages_vma 0.00 +0.1 0.11 ± 4% perf-profile.self.cycles-pp.__switch_to_asm 0.00 +0.1 0.12 ± 9% perf-profile.self.cycles-pp.cpumask_any_but 0.00 +0.1 0.12 ± 7% perf-profile.self.cycles-pp.lru_cache_add_active_or_unevictable 0.00 +0.1 0.12 ± 5% perf-profile.self.cycles-pp.down_read_trylock 0.00 +0.1 0.12 ± 8% perf-profile.self.cycles-pp.mem_cgroup_commit_charge 0.00 +0.1 0.12 ± 4% perf-profile.self.cycles-pp.pagevec_lru_move_fn 0.00 +0.1 0.13 ± 7% perf-profile.self.cycles-pp.flush_tlb_func_common 0.11 ± 7% +0.1 0.25 ± 7% perf-profile.self.cycles-pp.___might_sleep 0.00 +0.1 0.13 ± 9% perf-profile.self.cycles-pp.set_next_entity 0.00 +0.1 0.14 ± 11% perf-profile.self.cycles-pp.enqueue_entity 0.00 +0.1 0.14 ± 3% perf-profile.self.cycles-pp.__mod_lruvec_state 0.00 +0.1 0.14 ± 11% perf-profile.self.cycles-pp.__switch_to 0.00 +0.1 0.14 ± 8% perf-profile.self.cycles-pp.copy_user_highpage 0.00 +0.1 0.15 ± 7% perf-profile.self.cycles-pp.__orc_find 0.00 +0.1 0.15 ± 3% perf-profile.self.cycles-pp.menu_select 0.00 +0.1 0.15 ± 7% perf-profile.self.cycles-pp.ptep_clear_flush 0.00 +0.1 0.15 ± 7% perf-profile.self.cycles-pp.__lock_page 0.00 +0.1 0.15 ± 5% perf-profile.self.cycles-pp.page_add_new_anon_rmap 0.00 +0.2 0.15 ± 3% perf-profile.self.cycles-pp.__account_scheduler_latency 0.00 +0.2 0.16 ± 9% perf-profile.self.cycles-pp.swapgs_restore_regs_and_return_to_usermode 0.00 +0.2 0.16 ± 5% perf-profile.self.cycles-pp.__mod_zone_page_state 0.00 +0.2 0.17 ± 4% perf-profile.self.cycles-pp.up_read 0.00 +0.2 0.17 ± 8% perf-profile.self.cycles-pp.__lru_cache_add 0.00 +0.2 0.18 ± 6% perf-profile.self.cycles-pp.__might_sleep 0.00 +0.2 0.19 ± 5% perf-profile.self.cycles-pp.free_pages_and_swap_cache 0.00 +0.2 0.20 ± 3% perf-profile.self.cycles-pp.__perf_sw_event 0.00 +0.2 0.20 ± 4% perf-profile.self.cycles-pp.__list_add_valid 0.00 +0.2 0.20 ± 9% perf-profile.self.cycles-pp.__count_memcg_events 0.00 +0.2 0.21 ± 7% perf-profile.self.cycles-pp.unlock_page 0.00 +0.2 0.21 ± 3% perf-profile.self.cycles-pp.mem_cgroup_throttle_swaprate 0.00 +0.2 0.22 ± 3% perf-profile.self.cycles-pp.page_fault 0.00 +0.2 0.22 ± 3% perf-profile.self.cycles-pp.handle_pte_fault 0.00 +0.2 0.23 ± 7% perf-profile.self.cycles-pp.__split_huge_pmd_locked 0.00 +0.2 0.23 perf-profile.self.cycles-pp.vmacache_find 0.06 ± 14% +0.2 0.30 ± 4% perf-profile.self.cycles-pp._raw_spin_lock_irqsave 0.00 +0.2 0.24 ± 6% perf-profile.self.cycles-pp.unwind_next_frame 0.00 +0.2 0.24 ± 6% perf-profile.self.cycles-pp.flush_tlb_mm_range 0.00 +0.2 0.25 ± 4% perf-profile.self.cycles-pp._find_next_bit 0.00 +0.3 0.26 ± 12% perf-profile.self.cycles-pp.__sched_text_start 0.00 +0.3 0.26 ± 3% perf-profile.self.cycles-pp.__mod_memcg_state 0.00 +0.3 0.29 ± 3% perf-profile.self.cycles-pp.put_page 0.00 +0.3 0.32 ± 4% perf-profile.self.cycles-pp.do_page_fault 0.00 +0.3 0.32 ± 3% perf-profile.self.cycles-pp.__mod_node_page_state 0.00 +0.3 0.33 perf-profile.self.cycles-pp.__alloc_pages_nodemask 0.00 +0.3 0.34 ± 3% perf-profile.self.cycles-pp.get_mem_cgroup_from_mm 0.00 +0.4 0.41 ± 3% perf-profile.self.cycles-pp.unmap_page_range 0.00 +0.4 0.43 perf-profile.self.cycles-pp.page_remove_rmap 0.00 +0.4 0.43 ± 2% perf-profile.self.cycles-pp.release_pages 0.01 ±173% +0.4 0.45 ± 2% perf-profile.self.cycles-pp._raw_spin_lock 0.00 +0.4 0.45 ± 3% perf-profile.self.cycles-pp.try_to_wake_up 0.00 +0.5 0.45 perf-profile.self.cycles-pp.wp_page_copy 0.00 +0.5 0.52 ± 3% perf-profile.self.cycles-pp.free_pcppages_bulk 0.00 +0.6 0.57 ± 6% perf-profile.self.cycles-pp.handle_mm_fault 0.66 +0.6 1.24 ± 5% perf-profile.self.cycles-pp.get_page_from_freelist 0.05 ± 8% +0.6 0.64 ± 5% perf-profile.self.cycles-pp.___perf_sw_event 0.00 +0.6 0.62 ± 2% perf-profile.self.cycles-pp.__pagevec_lru_add_fn 0.00 +0.6 0.63 ± 6% perf-profile.self.cycles-pp.mem_cgroup_try_charge_delay 0.00 +0.7 0.67 ± 9% perf-profile.self.cycles-pp.__handle_mm_fault 0.01 ±173% +0.8 0.78 ± 3% perf-profile.self.cycles-pp.__list_del_entry_valid 0.00 +0.9 0.87 perf-profile.self.cycles-pp.do_wp_page 0.00 +1.2 1.19 ± 4% perf-profile.self.cycles-pp.native_flush_tlb_one_user 0.00 +1.4 1.37 ± 4% perf-profile.self.cycles-pp.sync_regs 0.07 ± 10% +1.8 1.84 ± 3% perf-profile.self.cycles-pp.native_irq_return_iret 0.60 ± 29% +2.0 2.58 ± 11% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath 10.62 ± 2% +2.5 13.09 perf-profile.self.cycles-pp.do_access 1.01 ± 10% +2.7 3.70 ± 9% perf-profile.self.cycles-pp.intel_idle 0.00 +2.9 2.85 ± 8% perf-profile.self.cycles-pp.poll_idle 0.21 ± 5% +13.1 13.30 ± 2% perf-profile.self.cycles-pp.reuse_swap_page vm-scalability.time.user_time 2600 +--------------------------------------------------------------------+ |.+..+.+..+.+..+.+..+.+..+.+..+. .+. .| 2500 |-+ +..+.+.+..+.+. +..+.+..+.+..+.+..+ | 2400 |-+ | | | 2300 |-+ | 2200 |-+ | | | 2100 |-+ | 2000 |-+ | | | 1900 |-+ | 1800 |-+ | | O O O O O O O O O O O O O O O O O O O O O O O O O O | 1700 +--------------------------------------------------------------------+ vm-scalability.time.system_time 2300 +--------------------------------------------------------------------+ | O O O O O O O O O O O | 2250 |-O O O O O O O O O O | 2200 |-+ O O O | | | 2150 |-+ | 2100 |-+ | | | 2050 |-+ | 2000 |-+ | | | 1950 |-+ | 1900 |.+.. .+.. .+..+.+.. .+..+.+.+..+.+..+.+..+.+..+.+.. .+.| | +.+..+ + +.+..+ +.+. | 1850 +--------------------------------------------------------------------+ vm-scalability.time.percent_of_cpu_this_job_got 1480 +--------------------------------------------------------------------+ |.+..+.+..+.+..+.+..+.+..+.+..+. .+..+. .+. .| 1460 |-+ +..+.+.+..+ +. +..+.+..+.+..+ | 1440 |-+ | | | 1420 |-+ | 1400 |-+ | | | 1380 |-+ | 1360 |-+ | | O O O O O O O O O O | 1340 |-O O O O | 1320 |-+ O O O | | O O O O O O O O O | 1300 +--------------------------------------------------------------------+ vm-scalability.time.minor_page_faults 7e+08 +-------------------------------------------------------------------+ | O O O O O O O O O O O O O O O O O O O O O O O O O O | 6e+08 |-+ | | | 5e+08 |-+ | | | 4e+08 |-+ | | | 3e+08 |-+ | | | 2e+08 |-+ | | | 1e+08 |-+ | | | 0 +-------------------------------------------------------------------+ vm-scalability.time.voluntary_context_switches 6e+07 +-------------------------------------------------------------------+ | O O O O O O | 5e+07 |-O O O O O O O O O O O O O O | | O O O O O O | | | 4e+07 |-+ | | | 3e+07 |-+ | | | 2e+07 |-+ | | | | | 1e+07 |-+ | | | 0 +-------------------------------------------------------------------+ vm-scalability.throughput 1.18e+07 +----------------------------------------------------------------+ | | 1.16e+07 |-O O O O O O O O O O O O O | | O O O O O O O O O | 1.14e+07 |-+ O O O O | | | 1.12e+07 |-+ | | | 1.1e+07 |-+ | | | 1.08e+07 |-+ | | | 1.06e+07 |-+ | |.+..+.+.+..+.+.+..+.+.+..+.+.+..+.+.+.+..+.+.+..+.+.+..+.+.+..+.| 1.04e+07 +----------------------------------------------------------------+ vm-scalability.median 730000 +------------------------------------------------------------------+ | O O O O O O O O O O O O O O O | 720000 |-+ O O O O O O O | 710000 |-+ O O O O | | | 700000 |-+ | | | 690000 |-+ | | | 680000 |-+ | 670000 |-+ | | | 660000 |-+ .+.+..+.+.+..+.+..+.+.+..+.+..+.+.+..+.+.+..+.+..+.+.+..+.+..+.| |.+. | 650000 +------------------------------------------------------------------+ [*] bisect-good sample [O] bisect-bad sample Disclaimer: Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. Thanks, Rong Chen