79 Commits

Author SHA1 Message Date
Wilson Sung
03513ea269 Merge android-4.14-stable (4.14.248) into android-msm-pixel-4.14-lts
Merge 4.14.248 into android-4.14-stable
Linux 4.14.248
    drm/nouveau/nvkm: Replace -ENOSYS with -ENODEV
    blk-throttle: fix UAF by deleteing timer in blk_throtl_exit()
    pwm: rockchip: Don't modify HW state in .remove() callback
    nilfs2: fix memory leak in nilfs_sysfs_delete_snapshot_group
    nilfs2: fix memory leak in nilfs_sysfs_create_snapshot_group
    nilfs2: fix memory leak in nilfs_sysfs_delete_##name##_group
    nilfs2: fix memory leak in nilfs_sysfs_create_##name##_group
    nilfs2: fix NULL pointer in nilfs_##name##_attr_release
    nilfs2: fix memory leak in nilfs_sysfs_create_device_group
    ceph: lockdep annotations for try_nonblocking_invalidate
    dmaengine: xilinx_dma: Set DMA mask for coherent APIs
  * dmaengine: ioat: depends on !UML
      drivers/dma/Kconfig
    parisc: Move pci_dev_is_behind_card_dino to where it is used
  * Kconfig.debug: drop selecting non-existing HARDLOCKUP_DETECTOR_ARCH
      lib/Kconfig.debug
    pwm: lpc32xx: Don't modify HW state in .probe() after the PWM chip was registered
  * profiling: fix shift-out-of-bounds bugs
      kernel/profile.c
  * prctl: allow to setup brk for et_dyn executables
      kernel/sys.c
    9p/trans_virtio: Remove sysfs file on probe failure
    thermal/drivers/exynos: Fix an error code in exynos_tmu_probe()
    dmaengine: acpi: Avoid comparison GSI with Linux vIRQ
  * sctp: add param size validation for SCTP_PARAM_SET_PRIMARY
      net/sctp/sm_make_chunk.c
  * sctp: validate chunk size in __rcv_asconf_lookup
      net/sctp/input.c
    crypto: talitos - fix max key size for sha384 and sha512
    apparmor: remove duplicate macro list_entry_is_head()
  * rcu: Fix missed wakeup of exp_wq waiters
      kernel/rcu/tree_exp.h
    s390/bpf: Fix optimizing out zero-extensions
    Merge 4.14.247 into android-4.14-stable
Linux 4.14.247
    s390/bpf: Fix 64-bit subtraction of the -0x80000000 constant
    net: renesas: sh_eth: Fix freeing wrong tx descriptor
    qlcnic: Remove redundant unlock in qlcnic_pinit_from_rom
  * netfilter: socket: icmp6: fix use-after-scope
      net/ipv6/netfilter/nf_socket_ipv6.c
    net: dsa: b53: Fix calculating number of switch ports
    ARC: export clear_user_page() for modules
    mtd: rawnand: cafe: Fix a resource leak in the error handling path of 'cafe_nand_probe()'
  * PCI: Sync __pci_register_driver() stub for CONFIG_PCI=n
      include/linux/pci.h
    ethtool: Fix an error code in cxgb2.c
    net: usb: cdc_mbim: avoid altsetting toggling for Telit LN920
  * PCI: Add ACS quirks for Cavium multi-function devices
      drivers/pci/quirks.c
    mfd: Don't use irq_create_mapping() to resolve a mapping
    dt-bindings: mtd: gpmc: Fix the ECC bytes vs. OOB bytes equation
  * mm/memory_hotplug: use "unsigned long" for PFN in zone_for_pfn_range()
      include/linux/memory_hotplug.h
  * tcp: fix tp->undo_retrans accounting in tcp_sacktag_one()
      net/ipv4/tcp_input.c
  * net/af_unix: fix a data-race in unix_dgram_poll
      include/linux/skbuff.h
      net/unix/af_unix.c
  * events: Reuse value read using READ_ONCE instead of re-reading it
      kernel/events/core.c
    tipc: increase timeout in tipc_sk_enqueue()
    r6040: Restore MDIO clock frequency after MAC reset
  * net/l2tp: Fix reference count leak in l2tp_udp_recv_core
      net/l2tp/l2tp_core.c
    dccp: don't duplicate ccid when cloning dccp sock
    ptp: dp83640: don't define PAGE0
    net-caif: avoid user-triggerable WARN_ON(1)
    x86/mm: Fix kern_addr_valid() to cope with existing but not present entries
    PM: base: power: don't try to use non-existing RTC for storing data
    bnx2x: Fix enabling network interfaces without VFs
    xen: reset legacy rtc flag for PV domU
    platform/chrome: cros_ec_proto: Send command again when timeout occurs
    memcg: enable accounting for pids in nested pid namespaces
  * mm/hugetlb: initialize hugetlb_usage in mm_init
      include/linux/hugetlb.h
      kernel/fork.c
    cpufreq: powernv: Fix init_chip_info initialization in numa=off
    scsi: qla2xxx: Sync queue idx with queue_pair_map idx
    scsi: BusLogic: Fix missing pr_cont() use
    parisc: fix crash with signals and alloca
    net: w5100: check return value after calling platform_get_resource()
    net: fix NULL pointer reference in cipso_v4_doi_free
    ath9k: fix sleeping in atomic context
    ath9k: fix OOB read ar9300_eeprom_restore_internal
    parport: remove non-zero check on count
    ASoC: rockchip: i2s: Fixup config for DAIFMT_DSP_A/B
    ASoC: rockchip: i2s: Fix regmap_ops hang
    usbip:vhci_hcd USB port can get stuck in the disabled state
    usbip: give back URBs for unsent unlink requests during cleanup
    usb: musb: musb_dsps: request_irq() after initializing musb
  * Revert "USB: xhci: fix U1/U2 handling for hardware with XHCI_INTEL_HOST quirk set"
      drivers/usb/host/xhci.c
    cifs: fix wrong release in sess_alloc_buffer() failed path
    selftests/bpf: Enlarge select() timeout for test_maps
    mmc: rtsx_pci: Fix long reads when clock is prescaled
    mmc: sdhci-of-arasan: Check return value of non-void funtions
    gfs2: Don't call dlm after protocol is unmounted
    staging: rts5208: Fix get_ms_information() heap buffer size
    rpc: fix gss_svc_init cleanup on failure
    ARM: tegra: tamonten: Fix UART pad setting
    gpu: drm: amd: amdgpu: amdgpu_i2c: fix possible uninitialized-variable access in amdgpu_i2c_router_select_ddc_port()
  * Bluetooth: avoid circular locks in sco_sock_connect
      net/bluetooth/sco.c
    net: ethernet: stmmac: Do not use unreachable() in ipq806x_gmac_probe()
    arm64: dts: qcom: sdm660: use reg value for memory node
  * media: v4l2-dv-timings.c: fix wrong condition in two for-loops
      drivers/media/v4l2-core/v4l2-dv-timings.c
    ASoC: Intel: bytcr_rt5640: Move "Platform Clock" routes to the maps for the matching in-/output
  * Bluetooth: skip invalid hci_sync_conn_complete_evt
      net/bluetooth/hci_event.c
    ata: sata_dwc_460ex: No need to call phy_exit() befre phy_init()
    staging: ks7010: Fix the initialization of the 'sleep_status' structure
    serial: 8250_pci: make setup_port() parameters explicitly unsigned
    hvsi: don't panic on tty_register_driver failure
    xtensa: ISS: don't panic in rs_init
    serial: 8250: Define RX trigger levels for OxSemi 950 devices
    s390/jump_label: print real address in a case of a jump label bug
  * flow_dissector: Fix out-of-bounds warnings
      net/core/flow_dissector.c
  * ipv4: ip_output.c: Fix out-of-bounds warning in ip_copy_addrs()
      net/ipv4/ip_output.c
    video: fbdev: riva: Error out if 'pixclock' equals zero
    video: fbdev: kyro: Error out if 'pixclock' equals zero
    video: fbdev: asiliantfb: Error out if 'pixclock' equals zero
    bpf/tests: Do not PASS tests without actually testing the result
    bpf/tests: Fix copy-and-paste error in double word test
    tty: serial: jsm: hold port lock when reporting modem line changes
    staging: board: Fix uninitialized spinlock when attaching genpd
  * usb: gadget: composite: Allow bMaxPower=0 if self-powered
      drivers/usb/gadget/composite.c
  * usb: gadget: u_ether: fix a potential null pointer dereference
      drivers/usb/gadget/function/u_ether.c
    usb: host: fotg210: fix the actual_length of an iso packet
    usb: host: fotg210: fix the endpoint's transactional opportunities calculation
    Smack: Fix wrong semantics in smk_access_entry()
  * netlink: Deal with ESRCH error in nlmsg_notify()
      net/netlink/af_netlink.c
    video: fbdev: kyro: fix a DoS bug by restricting user input
    ARM: dts: qcom: apq8064: correct clock names
    iio: dac: ad5624r: Fix incorrect handling of an optional regulator.
  * PCI: Use pci_update_current_state() in pci_enable_device_flags()
      drivers/pci/pci.c
    crypto: mxs-dcp - Use sg_mapping_iter to copy data
    media: dib8000: rewrite the init prbs logic
    MIPS: Malta: fix alignment of the devicetree buffer
    scsi: qedi: Fix error codes in qedi_alloc_global_queues()
    pinctrl: single: Fix error return code in pcs_parse_bits_in_pinctrl_entry()
    openrisc: don't printk() unconditionally
  * vfio: Use config not menuconfig for VFIO_NOIOMMU
      drivers/vfio/Kconfig
    pinctrl: samsung: Fix pinctrl bank pin count
    docs: Fix infiniband uverbs minor number
    RDMA/iwcm: Release resources if iw_cm module initialization fails
  * HID: input: do not report stylus battery state as "full"
      drivers/hid/hid-input.c
    PCI: aardvark: Fix masking and unmasking legacy INTx interrupts
    PCI: aardvark: Increase polling delay to 1.5s while waiting for PIO response
    PCI: xilinx-nwl: Enable the clock through CCF
  * PCI: Return ~0 data on pciconfig_read() CAP_SYS_ADMIN failure
      drivers/pci/syscall.c
  * PCI: Restrict ASMedia ASM1062 SATA Max Payload Size Supported
      drivers/pci/quirks.c
    ARM: 9105/1: atags_to_fdt: don't warn about stack size
    libata: add ATA_HORKAGE_NO_NCQ_TRIM for Samsung 860 and 870 SSDs
    media: rc-loopback: return number of emitters rather than error
  * media: uvc: don't do DMA on stack
      drivers/media/usb/uvc/uvc_v4l2.c
    VMCI: fix NULL pointer dereference when unmapping queue pair
  * dm crypt: Avoid percpu_counter spinlock contention in crypt_page_alloc()
      drivers/md/dm-crypt.c
    power: supply: max17042: handle fails of reading status register
    block: bfq: fix bfq_set_next_ioprio_data()
  * crypto: public_key: fix overflow during implicit conversion
      include/crypto/public_key.h
    soc: aspeed: lpc-ctrl: Fix boundary check for mmap
    9p/xen: Fix end of loop tests for list_for_each_entry
  * include/linux/list.h: add a macro to test if entry is pointing to the head
      include/linux/list.h
    xen: fix setting of max_pfn in shared_info
    powerpc/perf/hv-gpci: Fix counter value parsing
  * PCI/MSI: Skip masking MSI-X on Xen PV
      drivers/pci/msi.c
    blk-zoned: allow BLKREPORTZONE without CAP_SYS_ADMIN
    blk-zoned: allow zone management send operations without CAP_SYS_ADMIN
    rtc: tps65910: Correct driver module alias
  * fbmem: don't allow too huge resolutions
      drivers/video/fbdev/core/fbmem.c
    clk: kirkwood: Fix a clocking boot regression
    backlight: pwm_bl: Improve bootloader/kernel device handover
    IMA: remove -Wmissing-prototypes warning
    KVM: x86: Update vCPU's hv_clock before back to guest when tsc_offset is adjusted
    x86/resctrl: Fix a maybe-uninitialized build warning treated as error
  * tty: Fix data race between tiocsti() and flush_to_ldisc()
      drivers/tty/tty_io.c
  * netns: protect netns ID lookups with RCU
      net/core/net_namespace.c
    net: qualcomm: fix QCA7000 checksum handling
    net: sched: Fix qdisc_rate_table refcount leak when get tcf_block failed
  * ipv4: make exception cache less predictible
      net/ipv4/route.c
    bcma: Fix memory leak for internally-handled cores
    ath6kl: wmi: fix an error code in ath6kl_wmi_sync_point()
    tty: serial: fsl_lpuart: fix the wrong mapbase value
    usb: bdc: Fix an error handling path in 'bdc_probe()' when no suitable DMA config is available
    usb: ehci-orion: Handle errors of clk_prepare_enable() in probe
    i2c: mt65xx: fix IRQ check
    CIFS: Fix a potencially linear read overflow
    mmc: moxart: Fix issue with uninitialized dma_slave_config
    mmc: dw_mmc: Fix issue with uninitialized dma_slave_config
    i2c: s3c2410: fix IRQ check
    i2c: iop3xx: fix deferred probing
  * Bluetooth: add timeout sanity check to hci_inquiry
      net/bluetooth/hci_core.c
    usb: gadget: mv_u3d: request_irq() after initializing UDC
    mac80211: Fix insufficient headroom issue for AMSDU
    usb: phy: tahvo: add IRQ check
    usb: host: ohci-tmio: add IRQ check
  * Bluetooth: Move shutdown callback before flushing tx and rx queue
      net/bluetooth/hci_core.c
    usb: phy: twl6030: add IRQ checks
    usb: phy: fsl-usb: add IRQ check
    usb: gadget: udc: at91: add IRQ check
    drm/msm/dsi: Fix some reference counted resource leaks
  * Bluetooth: fix repeated calls to sco_sock_kill
      net/bluetooth/sco.c
    arm64: dts: exynos: correct GIC CPU interfaces address range on Exynos7
    Bluetooth: increase BTNAMSIZ to 21 chars to fix potential buffer overflow
    soc: qcom: smsm: Fix missed interrupts if state changes while masked
  * PCI: PM: Enable PME if it can be signaled from D3cold
      drivers/pci/pci.c
  * PCI: PM: Avoid forcing PCI_D0 for wakeup reasons inconsistently
      drivers/pci/pci.c
    media: em28xx-input: fix refcount bug in em28xx_usb_disconnect
    i2c: highlander: add IRQ check
    net: cipso: fix warnings in netlbl_cipsov4_add_std
  * tcp: seq_file: Avoid skipping sk during tcp_seek_last_pos
      net/ipv4/tcp_ipv4.c
  * Bluetooth: sco: prevent information leak in sco_conn_defer_accept()
      net/bluetooth/sco.c
    media: go7007: remove redundant initialization
    media: dvb-usb: fix uninit-value in vp702x_read_mac_addr
    media: dvb-usb: fix uninit-value in dvb_usb_adapter_dvb_init
  * soc: rockchip: ROCKCHIP_GRF should not default to y, unconditionally
      drivers/soc/rockchip/Kconfig
  * certs: Trigger creation of RSA module signing key if it's not an RSA key
      certs/Makefile
    crypto: qat - use proper type for vf_mask
    clocksource/drivers/sh_cmt: Fix wrong setting if don't request IRQ for clock source channel
    spi: spi-pic32: Fix issue with uninitialized dma_slave_config
    spi: spi-fsl-dspi: Fix issue with uninitialized dma_slave_config
    m68k: emu: Fix invalid free in nfeth_cleanup()
    udf_get_extendedattr() had no boundary checks.
    crypto: qat - do not export adf_iov_putmsg()
    crypto: qat - fix naming for init/shutdown VF to PF notifications
    crypto: qat - fix reuse of completion variable
    crypto: qat - handle both source of interrupt in VF ISR
    crypto: qat - do not ignore errors from enable_vf2pf_comms()
    libata: fix ata_host_start()
    s390/cio: add dev_busid sysfs entry for each subchannel
    power: supply: max17042_battery: fix typo in MAx17042_TOFF
    nvme-rdma: don't update queue count when failing to set io queues
    isofs: joliet: Fix iocharset=utf8 mount option
    udf: Check LVID earlier
    crypto: omap-sham - clear dma flags only after omap_sham_update_dma_stop()
    power: supply: axp288_fuel_gauge: Report register-address on readb / writeb errors
    crypto: mxs-dcp - Check for DMA mapping errors
  * regmap: fix the offset of register error log
      drivers/base/regmap/regmap.c
  * PCI: Call Max Payload Size-related fixup quirks early
      drivers/pci/quirks.c
    x86/reboot: Limit Dell Optiplex 990 quirk to early BIOS versions
    usb: host: xhci-rcar: Don't reload firmware after the completion
    Revert "btrfs: compression: don't try to compress if we don't have enough pages"
  * mm/page_alloc: speed up the iteration of max_order
      mm/page_alloc.c
    net: ll_temac: Remove left-over debug message
    powerpc/boot: Delete unneeded .globl _zimage_start
    powerpc/module64: Fix comment in R_PPC64_ENTRY handling
    crypto: talitos - reduce max key size for SEC1
    mm/kmemleak.c: make cond_resched() rate-limiting more efficient
    s390/disassembler: correct disassembly lines alignment
  * ipv4/icmp: l3mdev: Perform icmp error route lookup on source device routing table (v2)
      net/ipv4/icmp.c
    ath10k: fix recent bandwidth conversion bug
  * f2fs: fix potential overflow
      fs/f2fs/segment.c
    USB: serial: mos7720: improve OOM-handling in read_mos_reg()
  * igmp: Add ip_mc_list lock in ip_check_mc_rcu
      net/ipv4/igmp.c
    media: stkwebcam: fix memory leak in stk_camera_probe
  * clk: fix build warning for orphan_list
      drivers/clk/clk.c
  * ALSA: pcm: fix divide error in snd_pcm_lib_ioctl
      sound/core/pcm_lib.c
    ARM: 8918/2: only build return_address() if needed
  * cryptoloop: add a deprecation warning
      drivers/block/Kconfig
    perf/x86/amd/ibs: Work around erratum #1197
    perf/x86/intel/pt: Fix mask of num_address_ranges
    qede: Fix memset corruption
    net: macb: Add a NULL check on desc_ptp
    qed: Fix the VF msix vectors flow
    xtensa: fix kconfig unmet dependency warning for HAVE_FUTEX_CMPXCHG
  * ext4: fix race writing to an inline_data file while its xattrs are changing
      fs/ext4/inline.c
    Merge 4.14.246 into android-4.14-stable
Linux 4.14.246
    Revert "floppy: reintroduce O_NDELAY fix"
    KVM: X86: MMU: Use the correct inherited permissions to get shadow page
    KVM: x86/mmu: Treat NX as used (not reserved) for all !TDP shadow MMUs
  * fbmem: add margin check to fb_check_caps()
      drivers/video/fbdev/core/fbmem.c
    vt_kdsetmode: extend console locking
    net/rds: dma_map_sg is entitled to merge entries
    drm/nouveau/disp: power down unused DP links during init
  * drm: Copy drm_wait_vblank to user before returning
      drivers/gpu/drm/drm_ioc32.c
    vringh: Use wiov->used to check for read/write desc order
    virtio: Improve vq->broken access to avoid any compiler optimization
  * opp: remove WARN when no valid OPPs remain
      drivers/base/power/opp/of.c
    usb: gadget: u_audio: fix race condition on endpoint stop
    net: marvell: fix MVNETA_TX_IN_PRGRS bit number
    xgene-v2: Fix a resource leak in the error handling path of 'xge_probe()'
    ip_gre: add validation for csum_start
    e1000e: Fix the max snoop/no-snoop latency for 10M
    IB/hfi1: Fix possible null-pointer dereference in _extend_sdma_tx_descs()
  * usb: dwc3: gadget: Stop EP0 transfers during pullup disable
      drivers/usb/dwc3/gadget.c
  * usb: dwc3: gadget: Fix dwc3_calc_trbs_left()
      drivers/usb/dwc3/gadget.c
    USB: serial: option: add new VID/PID to support Fibocom FG150
    Revert "USB: serial: ch341: fix character loss at high transfer rates"
    can: usb: esd_usb2: esd_usb2_rx_event(): fix the interchange of the CAN RX and TX error counters
    ARC: Fix CONFIG_STACKDEPOT

Bug: 201722185
Change-Id: I2c4ee49c83eba8a514111e798114a2779c029203
Signed-off-by: Wilson Sung <wilsonsung@google.com>
2021-10-27 08:44:02 +00:00
David Hildenbrand
699bbc8bca mm/memory_hotplug: use "unsigned long" for PFN in zone_for_pfn_range()
commit 7cf209ba8a86410939a24cb1aeb279479a7e0ca6 upstream.

Patch series "mm/memory_hotplug: preparatory patches for new online policy and memory"

These are all cleanups and one fix previously sent as part of [1]:
[PATCH v1 00/12] mm/memory_hotplug: "auto-movable" online policy and memory
groups.

These patches make sense even without the other series, therefore I pulled
them out to make the other series easier to digest.

[1] https://lkml.kernel.org/r/20210607195430.48228-1-david@redhat.com

This patch (of 4):

Checkpatch complained on a follow-up patch that we are using "unsigned"
here, which defaults to "unsigned int" and checkpatch is correct.

As we will search for a fitting zone using the wrong pfn, we might end
up onlining memory to one of the special kernel zones, such as ZONE_DMA,
which can end badly as the onlined memory does not satisfy properties of
these zones.

Use "unsigned long" instead, just as we do in other places when handling
PFNs.  This can bite us once we have physical addresses in the range of
multiple TB.

Link: https://lkml.kernel.org/r/20210712124052.26491-2-david@redhat.com
Fixes: e5e6893026 ("mm, memory_hotplug: display allowed zones in the preferred ordering")
Signed-off-by: David Hildenbrand <david@redhat.com>
Reviewed-by: Pankaj Gupta <pankaj.gupta@ionos.com>
Reviewed-by: Muchun Song <songmuchun@bytedance.com>
Reviewed-by: Oscar Salvador <osalvador@suse.de>
Cc: David Hildenbrand <david@redhat.com>
Cc: Vitaly Kuznetsov <vkuznets@redhat.com>
Cc: "Michael S. Tsirkin" <mst@redhat.com>
Cc: Jason Wang <jasowang@redhat.com>
Cc: Pankaj Gupta <pankaj.gupta.linux@gmail.com>
Cc: Wei Yang <richard.weiyang@linux.alibaba.com>
Cc: Michal Hocko <mhocko@kernel.org>
Cc: Dan Williams <dan.j.williams@intel.com>
Cc: Anshuman Khandual <anshuman.khandual@arm.com>
Cc: Dave Hansen <dave.hansen@linux.intel.com>
Cc: Vlastimil Babka <vbabka@suse.cz>
Cc: Mike Rapoport <rppt@kernel.org>
Cc: "Rafael J. Wysocki" <rjw@rjwysocki.net>
Cc: Len Brown <lenb@kernel.org>
Cc: Pavel Tatashin <pasha.tatashin@soleen.com>
Cc: Heiko Carstens <hca@linux.ibm.com>
Cc: Michael Ellerman <mpe@ellerman.id.au>
Cc: Catalin Marinas <catalin.marinas@arm.com>
Cc: virtualization@lists.linux-foundation.org
Cc: Andy Lutomirski <luto@kernel.org>
Cc: "Aneesh Kumar K.V" <aneesh.kumar@linux.ibm.com>
Cc: Anton Blanchard <anton@ozlabs.org>
Cc: Ard Biesheuvel <ardb@kernel.org>
Cc: Baoquan He <bhe@redhat.com>
Cc: Benjamin Herrenschmidt <benh@kernel.crashing.org>
Cc: Borislav Petkov <bp@alien8.de>
Cc: Christian Borntraeger <borntraeger@de.ibm.com>
Cc: Christophe Leroy <christophe.leroy@c-s.fr>
Cc: Dave Jiang <dave.jiang@intel.com>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Jia He <justin.he@arm.com>
Cc: Joe Perches <joe@perches.com>
Cc: Kefeng Wang <wangkefeng.wang@huawei.com>
Cc: Laurent Dufour <ldufour@linux.ibm.com>
Cc: Michel Lespinasse <michel@lespinasse.org>
Cc: Nathan Lynch <nathanl@linux.ibm.com>
Cc: Nicholas Piggin <npiggin@gmail.com>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Pierre Morel <pmorel@linux.ibm.com>
Cc: "Rafael J. Wysocki" <rafael.j.wysocki@intel.com>
Cc: Rich Felker <dalias@libc.org>
Cc: Scott Cheloha <cheloha@linux.ibm.com>
Cc: Sergei Trofimovich <slyfox@gentoo.org>
Cc: Thiago Jung Bauermann <bauerman@linux.ibm.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Vasily Gorbik <gor@linux.ibm.com>
Cc: Vishal Verma <vishal.l.verma@intel.com>
Cc: Will Deacon <will@kernel.org>
Cc: Yoshinori Sato <ysato@users.sourceforge.jp>
Cc: <stable@vger.kernel.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: David Hildenbrand <david@redhat.com>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
2021-09-22 11:45:34 +02:00
lucaswei
c897920710 Merge android-4.14 (4.14.160) into android-msm-floral-4.14-lts
Merge 4.14.160 into android-4.14
Linux 4.14.160
    net: stmmac: don't stop NAPI processing when dropping a packet
    net: stmmac: use correct DMA buffer size in the RX descriptor
  * xhci: fix USB3 device initiated resume race with roothub autosuspend
      drivers/usb/host/xhci-hub.c
      drivers/usb/host/xhci-ring.c
    drm/radeon: fix r1xx/r2xx register checker for POT textures
    scsi: iscsi: Fix a potential deadlock in the timeout handler
    dm btree: increase rebalance threshold in __rebalance2()
  * dma-buf: Fix memory leak in sync_file_merge()
      drivers/dma-buf/sync_file.c
    vfio/pci: call irq_bypass_unregister_producer() before freeing irq
    ARM: tegra: Fix FLOW_CTLR_HALT register clobbering by tegra_resume()
    ARM: dts: s3c64xx: Fix init order of clock providers
    CIFS: Respect O_SYNC and O_DIRECT flags during reconnect
  * rpmsg: glink: Free pending deferred work on remove
      drivers/rpmsg/qcom_glink_native.c
  * rpmsg: glink: Don't send pending rx_done during remove
      drivers/rpmsg/qcom_glink_native.c
  * rpmsg: glink: Fix rpmsg_register_device err handling
      drivers/rpmsg/qcom_glink_native.c
  * rpmsg: glink: Put an extra reference during cleanup
      drivers/rpmsg/qcom_glink_native.c
  * rpmsg: glink: Fix use after free in open_ack TIMEOUT case
      drivers/rpmsg/qcom_glink_native.c
  * rpmsg: glink: Fix reuse intents memory leak issue
      drivers/rpmsg/qcom_glink_native.c
  * rpmsg: glink: Set tail pointer to 0 at end of FIFO
      drivers/rpmsg/qcom_glink_smem.c
    xtensa: fix TLB sanity checker
  * PCI: Apply Cavium ACS quirk to ThunderX2 and ThunderX3
      drivers/pci/quirks.c
  * PCI/MSI: Fix incorrect MSI-X masking on resume
      drivers/pci/msi.c
  * PCI: Fix Intel ACS quirk UPDCR register address
      drivers/pci/quirks.c
  * PCI/PM: Always return devices to D0 when thawing
      drivers/pci/pci-driver.c
  * Revert "regulator: Defer init completion for a while after late_initcall"
      drivers/regulator/core.c
    nvme: host: core: fix precedence of ternary operator
  * inet: protect against too small mtu values.
      include/linux/netdevice.h
      include/net/ip.h
      net/core/dev.c
      net/ipv4/devinet.c
      net/ipv4/ip_output.c
  * tcp: Protect accesses to .ts_recent_stamp with {READ,WRITE}_ONCE()
      include/net/tcp.h
  * tcp: tighten acceptance of ACKs not matching a child socket
      include/net/tcp.h
  * tcp: fix rejected syncookies due to stale timestamps
      include/linux/time.h
      include/net/tcp.h
    tipc: fix ordering of tipc module init and exit routine
  * tcp: md5: fix potential overestimation of TCP option space
      net/ipv4/tcp_output.c
    openvswitch: support asymmetric conntrack
    net: thunderx: start phy before starting autonegotiation
    net: ethernet: ti: cpsw: fix extra rx interrupt
  * net: dsa: fix flow dissection on Tx path
      net/core/flow_dissector.c
  * net: bridge: deny dev_set_mac_address() when unregistering
      net/bridge/br_device.c
    ANDROID: cuttlefish_defconfig: Enable CONFIG_GNSS_CMDLINE_SERIAL
  * ANDROID: gnss: Add command line test driver
      drivers/gnss/Kconfig
  * ANDROID: serdev: add platform device support
      drivers/tty/serdev/core.c
    ANDROID: cuttlefish_defconfig: set BINFMT_MISC
  * UPSTREAM: binder: fix incorrect calculation for num_valid
      drivers/android/binder.c
  * ANDROID: kbuild: disable clang-specific configs with other compilers
      Makefile
    Merge 4.14.159 into android-4.14
Linux 4.14.159
    of: unittest: fix memory leak in attach_node_and_children
    raid5: need to set STRIPE_HANDLE for batch head
    gpiolib: acpi: Add Terra Pad 1061 to the run_edge_events_on_boot_blacklist
  * kernel/module.c: wakeup processes in module_wq on module unload
      kernel/module.c
    gfs2: fix glock reference problem in gfs2_trans_remove_revoke
    net/mlx5e: Fix SFF 8472 eeprom length
    sunrpc: fix crash when cache_head become valid before update
  * workqueue: Fix missing kfree(rescuer) in destroy_workqueue()
      kernel/workqueue.c
  * blk-mq: make sure that line break can be printed
      block/blk-mq-sysfs.c
    mfd: rk808: Fix RK818 ID template
  * ext4: fix a bug in ext4_wait_for_tail_page_commit
      fs/ext4/inode.c
  * mm/shmem.c: cast the type of unmap_start to u64
      mm/shmem.c
  * firmware: qcom: scm: Ensure 'a0' status code is treated as signed
      drivers/firmware/qcom_scm-64.c
  * ext4: work around deleting a file with i_nlink == 0 safely
      fs/ext4/namei.c
    powerpc: Fix vDSO clock_getres()
    powerpc: Avoid clang warnings around setjmp and longjmp
    ath10k: fix fw crash by moving chip reset after napi disabled
    media: vimc: fix component match compare
    mlxsw: spectrum_router: Refresh nexthop neighbour when it becomes dead
    power: supply: cpcap-battery: Fix signed counter sample register
    x86/MCE/AMD: Carve out the MC4_MISC thresholding quirk
    x86/MCE/AMD: Turn off MC4_MISC thresholding on all family 0x15 models
    e100: Fix passing zero to 'PTR_ERR' warning in e100_load_ucode_wait
    drbd: Change drbd_request_detach_interruptible's return type to int
    scsi: lpfc: Correct code setting non existent bits in sli4 ABORT WQE
    scsi: lpfc: Cap NPIV vports to 256
    omap: pdata-quirks: remove openpandora quirks for mmc3 and wl1251
    phy: renesas: rcar-gen3-usb2: Fix sysfs interface of "role"
    iio: adis16480: Add debugfs_reg_access entry
  * xhci: make sure interrupts are restored to correct state
      drivers/usb/host/xhci-hub.c
  * xhci: Fix memory leak in xhci_add_in_port()
      drivers/usb/host/xhci-mem.c
    scsi: qla2xxx: Fix message indicating vectors used by driver
    scsi: qla2xxx: Always check the qla2x00_wait_for_hba_online() return value
    scsi: qla2xxx: Fix qla24xx_process_bidir_cmd()
    scsi: qla2xxx: Fix session lookup in qlt_abort_work()
    scsi: qla2xxx: Fix DMA unmap leak
    scsi: zfcp: trace channel log even for FCP command responses
  * block: fix single range discard merge
      block/blk-merge.c
    reiserfs: fix extended attributes on the root directory
  * ext4: Fix credit estimate for final inode freeing
      fs/ext4/inode.c
  * quota: fix livelock in dquot_writeback_dquots
      fs/quota/dquot.c
    ext2: check err when partial != NULL
  * quota: Check that quota is not dirty before release
      fs/quota/dquot.c
      include/linux/quotaops.h
  * video/hdmi: Fix AVI bar unpack
      drivers/video/hdmi.c
    powerpc/xive: Skip ioremap() of ESB pages for LSI interrupts
    powerpc: Allow flush_icache_range to work across ranges >4GB
    powerpc/xive: Prevent page fault issues in the machine crash handler
    powerpc: Allow 64bit VDSO __kernel_sync_dicache to work across ranges >4GB
    ppdev: fix PPGETTIME/PPSETTIME ioctls
    ARM: dts: omap3-tao3530: Fix incorrect MMC card detection GPIO polarity
    mmc: host: omap_hsmmc: add code for special init of wl1251 to get rid of pandora_wl1251_init_card
    pinctrl: samsung: Fix device node refcount leaks in S3C64xx wakeup controller init
    pinctrl: samsung: Fix device node refcount leaks in init code
    pinctrl: samsung: Fix device node refcount leaks in S3C24xx wakeup controller init
    pinctrl: samsung: Add of_node_put() before return in error path
    ACPI: PM: Avoid attaching ACPI PM domain to certain devices
    ACPI: bus: Fix NULL pointer check in acpi_bus_get_private_data()
    ACPI: OSL: only free map once in osl.c
    cpufreq: powernv: fix stack bloat and hard limit on number of CPUs
  * PM / devfreq: Lock devfreq in trans_stat_show
      drivers/devfreq/devfreq.c
    intel_th: pci: Add Tiger Lake CPU support
    intel_th: pci: Add Ice Lake CPU support
    intel_th: Fix a double put_device() in error path
  * cpuidle: Do not unset the driver if it is there already
      drivers/cpuidle/driver.c
  * media: cec.h: CEC_OP_REC_FLAG_ values were swapped
      include/uapi/linux/cec.h
    media: radio: wl1273: fix interrupt masking on release
    media: bdisp: fix memleak on release
    s390/mm: properly clear _PAGE_NOEXEC bit when it is not supported
    ar5523: check NULL before memcpy() in ar5523_cmd()
    cgroup: pids: use atomic64_t for pids->limit
  * blk-mq: avoid sysfs buffer overflow with too many CPU cores
      block/blk-mq-sysfs.c
  * ASoC: Jack: Fix NULL pointer dereference in snd_soc_jack_report
      sound/soc/soc-jack.c
  * workqueue: Fix pwq ref leak in rescuer_thread()
      kernel/workqueue.c
  * workqueue: Fix spurious sanity check failures in destroy_workqueue()
      kernel/workqueue.c
    dm zoned: reduce overhead of backing device checks
    hwrng: omap - Fix RNG wait loop timeout
    watchdog: aspeed: Fix clock behaviour for ast2600
    md/raid0: Fix an error message in raid0_make_request()
    ALSA: hda - Fix pending unsol events at shutdown
  * ovl: relax WARN_ON() on rename to self
      fs/overlayfs/dir.c
    lib: raid6: fix awk build warnings
    rtlwifi: rtl8192de: Fix missing enable interrupt flag
    rtlwifi: rtl8192de: Fix missing callback that tests for hw release of buffer
    rtlwifi: rtl8192de: Fix missing code to retrieve RX buffer address
    btrfs: record all roots for rename exchange on a subvol
    Btrfs: send, skip backreference walking for extents with many references
    btrfs: Remove btrfs_bio::flags member
    Btrfs: fix negative subv_writers counter and data space leak after buffered write
    btrfs: use refcount_inc_not_zero in kill_all_nodes
    btrfs: check page->mapping when loading free space cache
  * usb: dwc3: ep0: Clear started flag on completion
      drivers/usb/dwc3/ep0.c
    virtio-balloon: fix managed page counts when migrating pages between zones
    mtd: spear_smi: Fix Write Burst mode
    tpm: add check after commands attribs tab allocation
    usb: mon: Fix a deadlock in usbmon between mmap and read
  * usb: core: urb: fix URB structure initialization function
      drivers/usb/core/urb.c
    USB: adutux: fix interface sanity check
    USB: serial: io_edgeport: fix epic endpoint lookup
    USB: idmouse: fix interface sanity checks
    USB: atm: ueagle-atm: add missing endpoint check
    iio: humidity: hdc100x: fix IIO_HUMIDITYRELATIVE channel reporting
    ARM: dts: pandora-common: define wl1251 as child node of mmc3
  * xhci: handle some XHCI_TRUST_TX_LENGTH quirks cases as default behaviour.
      drivers/usb/host/xhci-ring.c
  * xhci: Increase STS_HALT timeout in xhci_suspend()
      drivers/usb/host/xhci.c
  * usb: xhci: only set D3hot for pci device
      drivers/usb/host/xhci-pci.c
      drivers/usb/host/xhci.c
      drivers/usb/host/xhci.h
    staging: gigaset: add endpoint-type sanity check
    staging: gigaset: fix illegal free on probe errors
    staging: gigaset: fix general protection fault on probe
    staging: rtl8712: fix interface sanity check
    staging: rtl8188eu: fix interface sanity check
  * usb: Allow USB device to be warm reset in suspended state
      drivers/usb/core/hub.c
    USB: documentation: flags on usb-storage versus UAS
    USB: uas: heed CAPACITY_HEURISTICS
    USB: uas: honor flag to avoid CAPACITY16
    media: venus: remove invalid compat_ioctl32 handler
    scsi: qla2xxx: Fix driver unload hang
    usb: gadget: pch_udc: fix use after free
  * usb: gadget: configfs: Fix missing spin_lock_init()
      drivers/usb/gadget/configfs.c
    appletalk: Set error code if register_snap_client failed
  * appletalk: Fix potential NULL pointer dereference in unregister_snap_client
      include/linux/atalk.h
    KVM: x86: fix out-of-bounds write in KVM_GET_EMULATED_CPUID (CVE-2019-19332)
    ASoC: rsnd: fixup MIX kctrl registration
  * binder: Handle start==NULL in binder_update_page_range()
      drivers/android/binder_alloc.c
  * thermal: Fix deadlock in thermal thermal_zone_device_check
      drivers/thermal/thermal_core.c
    iomap: Fix pipe page leakage during splicing
    RDMA/qib: Validate ->show()/store() callbacks before calling them
    spi: atmel: Fix CS high support
    crypto: user - fix memory leak in crypto_report
  * crypto: ecdh - fix big endian bug in ECC library
      crypto/ecc.c
    crypto: ccp - fix uninitialized list head
    crypto: af_alg - cast ki_complete ternary op to int
    crypto: crypto4xx - fix double-free in crypto4xx_destroy_sdr
    KVM: x86: fix presentation of TSX feature in ARCH_CAPABILITIES
    KVM: x86: do not modify masked bits of shared MSRs
    KVM: arm/arm64: vgic: Don't rely on the wrong pending table
    drm/i810: Prevent underflow in ioctl
  * jbd2: Fix possible overflow in jbd2_log_space_left()
      include/linux/jbd2.h
  * kernfs: fix ino wrap-around detection
      fs/kernfs/dir.c
      include/linux/kernfs.h
    can: slcan: Fix use-after-free Read in slcan_open
  * tty: vt: keyboard: reject invalid keycodes
      drivers/tty/vt/keyboard.c
    CIFS: Fix SMB2 oplock break processing
    CIFS: Fix NULL-pointer dereference in smb2_push_mandatory_locks
    x86/PCI: Avoid AMD FCH XHCI USB PME# from D0 defect
    Input: Fix memory leak in psxpad_spi_probe
    coresight: etm4x: Fix input validation for sysfs.
    Input: goodix - add upside-down quirk for Teclast X89 tablet
    Input: synaptics-rmi4 - don't increment rmiaddr for SMBus transfers
    Input: synaptics-rmi4 - re-enable IRQs in f34v7_do_reflash
    Input: synaptics - switch another X1 Carbon 6 to RMI/SMbus
    ALSA: hda - Add mute led support for HP ProBook 645 G4
    ALSA: pcm: oss: Avoid potential buffer overflows
    ALSA: hda/realtek - Dell headphone has noise on unmute for ALC236
  * fuse: verify attributes
      fs/fuse/dir.c
      fs/fuse/fuse_i.h
  * fuse: verify nlink
      fs/fuse/dir.c
  * sched/fair: Scale bandwidth quota and period without losing quota/period ratio precision
      kernel/sched/fair.c
  * tcp: exit if nothing to retransmit on RTO timeout
      net/ipv4/tcp_timer.c
    net: aquantia: fix RSS table and key sizes
    media: vimc: fix start stream when link is disabled
    ARM: dts: sunxi: Fix PMU compatible strings
    usb: mtu3: fix dbginfo in qmu_tx_zlp_error_handler
    mlx4: Use snprintf instead of complicated strcpy
    IB/hfi1: Close VNIC sdma_progress sleep window
    IB/hfi1: Ignore LNI errors before DC8051 transitions to Polling state
    mlxsw: spectrum_router: Relax GRE decap matching check
  * firmware: qcom: scm: fix compilation error when disabled
      include/linux/qcom_scm.h
    media: stkwebcam: Bugfix for wrong return values
  * tty: Don't block on IO when ldisc change is pending
      drivers/tty/n_tty.c
      drivers/tty/tty_ldisc.c
      include/linux/tty.h
    nfsd: Return EPERM, not EACCES, in some SETATTR cases
    MIPS: OCTEON: cvmx_pko_mem_debug8: use oldest forward compatible definition
    clk: renesas: r8a77995: Correct parent clock of DU
    powerpc/math-emu: Update macros from GCC
  * pstore/ram: Avoid NULL deref in ftrace merging failure path
      fs/pstore/ram.c
    net/mlx4_core: Fix return codes of unsupported operations
    dlm: fix invalid cluster name warning
    ARM: dts: realview: Fix some more duplicate regulator nodes
    clk: sunxi-ng: h3/h5: Fix CSI_MCLK parent
    ARM: dts: pxa: clean up USB controller nodes
    mtd: fix mtd_oobavail() incoherent returned value
  * kbuild: fix single target build for external module
      Makefile
  * modpost: skip ELF local symbols during section mismatch check
      scripts/mod/modpost.c
  * tcp: fix SNMP TCP timeout under-estimation
      net/ipv4/tcp_timer.c
  * tcp: fix SNMP under-estimation on failed retransmission
      net/ipv4/tcp_output.c
  * tcp: fix off-by-one bug on aborting window-probing socket
      net/ipv4/tcp_timer.c
    ARM: dts: realview-pbx: Fix duplicate regulator nodes
    ARM: dts: mmp2: fix the gpio interrupt cell number
    net/x25: fix null_x25_address handling
    net/x25: fix called/calling length calculation in x25_parse_address_block
    arm64: dts: meson-gxl-khadas-vim: fix GPIO lines names
    arm64: dts: meson-gxbb-odroidc2: fix GPIO lines names
    arm64: dts: meson-gxbb-nanopi-k2: fix GPIO lines names
    arm64: dts: meson-gxl-libretech-cc: fix GPIO lines names
    ARM: OMAP1/2: fix SoC name printing
    ASoC: au8540: use 64-bit arithmetic instead of 32-bit
    nfsd: fix a warning in __cld_pipe_upcall()
    ARM: debug: enable UART1 for socfpga Cyclone5
    dlm: NULL check before kmem_cache_destroy is not needed
    ARM: dts: sun8i: v3s: Change pinctrl nodes to avoid warning
    ARM: dts: sun5i: a10s: Fix HDMI output DTC warning
    ASoC: rsnd: tidyup registering method for rsnd_kctrl_new()
    lockd: fix decoding of TEST results
    i2c: imx: don't print error message on probe defer
    serial: imx: fix error handling in console_setup
    altera-stapl: check for a null key before strcasecmp'ing it
  * dma-mapping: fix return type of dma_set_max_seg_size()
      include/linux/dma-mapping.h
    sparc: Correct ctx->saw_frame_pointer logic.
  * f2fs: fix to allow node segment for GC by ioctl path
      fs/f2fs/gc.c
    ARM: dts: rockchip: Assign the proper GPIO clocks for rv1108
    ARM: dts: rockchip: Fix the PMU interrupt number for rv1108
  * f2fs: change segment to section in f2fs_ioc_gc_range
      fs/f2fs/file.c
  * f2fs: fix count of seg_freed to make sec_freed correct
      fs/f2fs/gc.c
  * ACPI: fix acpi_find_child_device() invocation in acpi_preset_companion()
      include/linux/acpi.h
  * usb: dwc3: don't log probe deferrals; but do log other error codes
      drivers/usb/dwc3/core.c
  * usb: dwc3: debugfs: Properly print/set link state for HS
      drivers/usb/dwc3/debug.h
      drivers/usb/dwc3/debugfs.c
    dmaengine: dw-dmac: implement dma protection control setting
    dmaengine: coh901318: Remove unused variable
    dmaengine: coh901318: Fix a double-lock bug
    media: cec: report Vendor ID after initialization
    media: pulse8-cec: return 0 when invalidating the logical address
    ARM: dts: exynos: Use Samsung SoC specific compatible for DWC2 module
    rtc: dt-binding: abx80x: fix resistance scale
    rtc: max8997: Fix the returned value in case of error in 'max8997_rtc_read_alarm()'
    math-emu/soft-fp.h: (_FP_ROUND_ZERO) cast 0 to void to fix warning
    net/smc: use after free fix in smc_wr_tx_put_slot()
    MIPS: OCTEON: octeon-platform: fix typing
    iomap: sub-block dio needs to zeroout beyond EOF
    net-next/hinic:fix a bug in set mac address
  * regulator: Fix return value of _set_load() stub
      include/linux/regulator/consumer.h
    clk: rockchip: fix ID of 8ch clock of I2S1 for rk3328
    clk: rockchip: fix I2S1 clock gate register for rk3328
  * mm/vmstat.c: fix NUMA statistics updates
      mm/vmstat.c
    Staging: iio: adt7316: Fix i2c data reading, set the data field
    pinctrl: qcom: ssbi-gpio: fix gpio-hog related boot issues
    crypto: bcm - fix normal/non key hash algorithm failure
  * crypto: ecc - check for invalid values in the key verification test
      crypto/ecc.c
    scsi: zfcp: drop default switch case which might paper over missing case
    net: dsa: mv88e6xxx: Work around mv886e6161 SERDES missing MII_PHYSID2
    MIPS: SiByte: Enable ZONE_DMA32 for LittleSur
    dlm: fix missing idr_destroy for recover_idr
    ARM: dts: rockchip: Fix rk3288-rock2 vcc_flash name
    clk: rockchip: fix rk3188 sclk_mac_lbtest parameter ordering
    clk: rockchip: fix rk3188 sclk_smc gate data
    i40e: don't restart nway if autoneg not supported
    rtc: s3c-rtc: Avoid using broken ALMYEAR register
    net: ethernet: ti: cpts: correct debug for expired txq skb
    extcon: max8997: Fix lack of path setting in USB device mode
    dlm: fix possible call to kfree() for non-initialized pointer
    clk: sunxi-ng: a64: Fix gate bit of DSI DPHY
    net/mlx5: Release resource on error flow
    ARM: 8813/1: Make aligned 2-byte getuser()/putuser() atomic on ARMv6+
    iwlwifi: mvm: Send non offchannel traffic via AP sta
    iwlwifi: mvm: synchronize TID queue removal
    cxgb4vf: fix memleak in mac_hlist initialization
  * serial: core: Allow processing sysrq at port unlock time
      include/linux/serial_core.h
  * i2c: core: fix use after free in of_i2c_notify
      drivers/i2c/i2c-core-of.c
    net: ep93xx_eth: fix mismatch of request_mem_region in remove
    rsxx: add missed destroy_workqueue calls in remove
  * ALSA: pcm: Fix stream lock usage in snd_pcm_period_elapsed()
      sound/core/pcm_lib.c
  * sched/core: Avoid spurious lock dependencies
      kernel/sched/core.c
    Input: cyttsp4_core - fix use after free bug
  * xfrm: release device reference for invalid state
      net/xfrm/xfrm_input.c
    NFC: nxp-nci: Fix NULL pointer dereference after I2C communication error
    audit_get_nd(): don't unlock parent too early
  * exportfs_decode_fh(): negative pinned may become positive without the parent locked
      fs/exportfs/expfs.c
    iwlwifi: pcie: don't consider IV len in A-MSDU
    RDMA/hns: Correct the value of HNS_ROCE_HEM_CHUNK_LEN
    autofs: fix a leak in autofs_expire_indirect()
    serial: ifx6x60: add missed pm_runtime_disable
  * serial: serial_core: Perform NULL checks for break_ctl ops
      drivers/tty/serial/serial_core.c
    serial: pl011: Fix DMA ->flush_buffer()
    tty: serial: msm_serial: Fix flow control
    tty: serial: fsl_lpuart: use the sg count from dma_map_sg
    usb: gadget: u_serial: add missing port entry locking
    arm64: tegra: Fix 'active-low' warning for Jetson TX1 regulator
    rsi: release skb if rsi_prepare_beacon fails
  * ANDROID: staging: android: ion: Fix build when CONFIG_ION_SYSTEM_HEAP=n
      drivers/staging/android/ion/ion.h
    Merge remote-tracking branch 'origin/upstream-f2fs-stable-linux-4.14.y/v5.5-rc1' into android-4.14
  * ANDROID: staging: android: ion: Expose total heap and pool sizes via sysfs
      drivers/staging/android/ion/ion.c
      drivers/staging/android/ion/ion.h
      drivers/staging/android/ion/ion_page_pool.c
  * UPSTREAM: include/linux/slab.h: fix sparse warning in kmalloc_type()
      include/linux/slab.h
  * UPSTREAM: mm, slab: shorten kmalloc cache names for large sizes
      mm/slab_common.c
  * UPSTREAM: mm, proc: add KReclaimable to /proc/meminfo
      fs/proc/meminfo.c
  * BACKPORT: mm: rename and change semantics of nr_indirectly_reclaimable_bytes
      drivers/staging/android/ion/ion_page_pool.c
      include/linux/mmzone.h
      mm/page_alloc.c
      mm/util.c
      mm/vmstat.c
  * UPSTREAM: dcache: allocate external names from reclaimable kmalloc caches
      fs/dcache.c
  * BACKPORT: mm, slab/slub: introduce kmalloc-reclaimable caches
      include/linux/slab.h
      mm/slab_common.c
  * UPSTREAM: mm, slab: combine kmalloc_caches and kmalloc_dma_caches
      include/linux/slab.h
      mm/slab_common.c
      mm/slub.c
  * ANDROID: kbuild: disable SCS by default in allmodconfig
      arch/Kconfig
      arch/arm64/Kconfig
    ANDROID: arm64: cuttlefish_defconfig: enable LTO, CFI, and SCS
  * BACKPORT: FROMLIST: arm64: implement Shadow Call Stack
      arch/arm64/Kconfig
      arch/arm64/include/asm/scs.h
      arch/arm64/include/asm/stacktrace.h
      arch/arm64/include/asm/thread_info.h
      arch/arm64/kernel/Makefile
      arch/arm64/kernel/asm-offsets.c
      arch/arm64/kernel/entry.S
      arch/arm64/kernel/head.S
      arch/arm64/kernel/irq.c
      arch/arm64/kernel/process.c
      arch/arm64/kernel/scs.c
      arch/arm64/kernel/smp.c
    FROMLIST: arm64: disable SCS for hypervisor code
  * BACKPORT: FROMLIST: arm64: vdso: disable Shadow Call Stack
      arch/arm64/kernel/vdso/Makefile
  * FROMLIST: arm64: preserve x18 when CPU is suspended
      arch/arm64/include/asm/suspend.h
      arch/arm64/mm/proc.S
  * FROMLIST: arm64: reserve x18 from general allocation with SCS
      arch/arm64/Makefile
  * FROMLIST: arm64: disable function graph tracing with SCS
      arch/arm64/Kconfig
  * FROMLIST: scs: add support for stack usage debugging
      kernel/scs.c
  * FROMLIST: scs: add accounting
      fs/proc/meminfo.c
      include/linux/mmzone.h
      kernel/scs.c
      mm/page_alloc.c
      mm/vmstat.c
  * FROMLIST: add support for Clang's Shadow Call Stack (SCS)
      Makefile
      arch/Kconfig
      include/linux/compiler-clang.h
      include/linux/compiler_types.h
      include/linux/scs.h
      init/init_task.c
      kernel/Makefile
      kernel/fork.c
      kernel/sched/core.c
      kernel/scs.c
    FROMLIST: arm64: kernel: avoid x18 in __cpu_soft_restart
    FROMLIST: arm64: kvm: stop treating register x18 as caller save
  * FROMLIST: arm64/lib: copy_page: avoid x18 register in assembler code
      arch/arm64/lib/copy_page.S
  * FROMLIST: arm64: mm: avoid x18 in idmap_kpti_install_ng_mappings
      arch/arm64/mm/proc.S
  * ANDROID: use non-canonical CFI jump tables
      Makefile
  * ANDROID: arm64: add __nocfi to __apply_alternatives
      arch/arm64/kernel/alternative.c
  * ANDROID: arm64: add __pa_function
      arch/arm64/include/asm/memory.h
      arch/arm64/include/asm/mmu_context.h
      arch/arm64/kernel/cpufeature.c
      arch/arm64/kernel/psci.c
      arch/arm64/kernel/smp_spin_table.c
  * ANDROID: arm64: allow ThinLTO to be selected
      arch/arm64/Kconfig
  * ANDROID: soc/tegra: disable ARCH_TEGRA_210_SOC with LTO
      drivers/soc/tegra/Kconfig
  * FROMLIST: arm64: fix alternatives with LLVM's integrated assembler
      arch/arm64/include/asm/alternative.h
  * ANDROID: irqchip/gic-v3: rename gic_of_init to work around a ThinLTO+CFI bug
      drivers/irqchip/irq-gic-v3.c
  * ANDROID: kbuild: limit LTO inlining
      Makefile
  * ANDROID: kbuild: merge module sections with LTO
      Makefile
  * ANDROID: init: ensure initcall ordering with LTO
      include/linux/init.h
      scripts/generate_initcall_order.pl
      scripts/link-vmlinux.sh
  * Revert "ANDROID: HACK: init: ensure initcall ordering with LTO"
      include/linux/init.h
  * ANDROID: add support for ThinLTO
      Makefile
      arch/Kconfig
      scripts/Makefile.build
    ANDROID: Switch to LLD
    ANDROID: clang: update to 10.0.1
  * ANDROID: arm64: add atomic_ll_sc.o to obj-y if using lld
      arch/arm64/lib/Makefile
  * ANDROID: enable ARM64_ERRATUM_843419 by default with LTO_CLANG
      arch/arm64/Kconfig
  * ANDROID: kbuild: allow lld to be used with CONFIG_LTO_CLANG
      Makefile
  * ANDROID: Makefile: set -Qunused-arguments sooner
      Makefile
  * BACKPORT: FROMLIST: Makefile: lld: tell clang to use lld
      Makefile
  * BACKPORT: FROMLIST: Makefile: lld: set -O2 linker flag when linking with LLD
      Makefile
  * ANDROID: scripts/Kbuild: add ld-name support for ld.lld
      scripts/Kbuild.include
  * UPSTREAM: bpf: permit multiple bpf attachments for a single perf event
      include/linux/bpf.h
      include/linux/trace_events.h
      include/trace/perf.h
      kernel/bpf/core.c
      kernel/events/core.c
      kernel/trace/bpf_trace.c
      kernel/trace/trace_uprobe.c
  * UPSTREAM: bpf: use the same condition in perf event set/free bpf handler
      kernel/events/core.c
  * UPSTREAM: bpf: multi program support for cgroup+bpf
      include/linux/bpf-cgroup.h
      include/linux/bpf.h
      include/linux/filter.h
      include/uapi/linux/bpf.h
      kernel/bpf/cgroup.c
      kernel/bpf/core.c
      kernel/bpf/syscall.c
      kernel/cgroup/cgroup.c
  * BACKPORT: serdev: make synchronous write return bytes written
      drivers/tty/serdev/core.c
    UPSTREAM: gnss: serial: fix synchronous write timeout
    UPSTREAM: gnss: fix potential error pointer dereference
    BACKPORT: gnss: add receiver type support
    UPSTREAM: dt-bindings: add generic gnss binding
  * UPSTREAM: gnss: add generic serial driver
      drivers/gnss/Kconfig
    ANDROID: cuttlefish_defconfig: Enable CONFIG_SERIAL_DEV_BUS
    ANDROID: cuttlefish_defconfig: Enable CONFIG_GNSS
  * BACKPORT: gnss: add GNSS receiver subsystem
      drivers/Kconfig
      drivers/Makefile
      drivers/gnss/Kconfig
  * UPSTREAM: arm64: Validate tagged addresses in access_ok() called from kernel threads
      arch/arm64/include/asm/uaccess.h
    BACKPORT: ARM: 8905/1: Emit __gnu_mcount_nc when using Clang 10.0.0 or newer
    Merge 4.14.158 into android-4.14
Linux 4.14.158
    net: fec: fix clock count mis-match
    platform/x86: hp-wmi: Fix ACPI errors caused by passing 0 as input size
    platform/x86: hp-wmi: Fix ACPI errors caused by too small buffer
    ASoC: stm32: i2s: fix IRQ clearing
    ASoC: stm32: i2s: fix 16 bit format support
    ASoC: stm32: i2s: fix dma configuration
    pinctrl: stm32: fix memory leak issue
    mailbox: mailbox-test: fix null pointer if no mmio
    hwrng: stm32 - fix unbalanced pm_runtime_enable
    media: stm32-dcmi: fix DMA corruption when stopping streaming
    crypto: stm32/hash - Fix hmac issue more than 256 bytes
  * HID: core: check whether Usage Page item is after Usage ID items
      drivers/hid/hid-core.c
  * futex: Prevent exit livelock
      kernel/futex.c
  * futex: Provide distinct return value when owner is exiting
      kernel/futex.c
  * futex: Add mutex around futex exit
      include/linux/futex.h
      include/linux/sched.h
      kernel/futex.c
  * futex: Provide state handling for exec() as well
      kernel/futex.c
  * futex: Sanitize exit state handling
      kernel/futex.c
  * futex: Mark the begin of futex exit explicitly
      include/linux/futex.h
      kernel/exit.c
      kernel/futex.c
  * futex: Set task::futex_state to DEAD right after handling futex exit
      kernel/exit.c
      kernel/futex.c
  * futex: Split futex_mm_release() for exit/exec
      include/linux/futex.h
      kernel/fork.c
      kernel/futex.c
  * exit/exec: Seperate mm_release()
      fs/exec.c
      include/linux/sched/mm.h
      kernel/exit.c
      kernel/fork.c
  * futex: Replace PF_EXITPIDONE with a state
      include/linux/futex.h
      include/linux/sched.h
      kernel/exit.c
      kernel/futex.c
  * futex: Move futex exit handling into futex code
      include/linux/compat.h
      include/linux/futex.h
      kernel/fork.c
      kernel/futex.c
  * futex: Prevent robust futex exit race
      kernel/futex.c
  * y2038: futex: Move compat implementation into futex.c
      include/linux/futex.h
      kernel/Makefile
      kernel/futex.c
    mtd: spi-nor: cast to u64 to avoid uint overflows
    mtd: rawnand: atmel: fix possible object reference leak
    mtd: rawnand: atmel: Fix spelling mistake in error message
    net: macb driver, check for SKBTX_HW_TSTAMP
    net: macb: Fix SUBNS increment and increase resolution
    watchdog: sama5d4: fix WDD value to be always set to max
  * ext4: add more paranoia checking in ext4_expand_extra_isize handling
      fs/ext4/inode.c
      fs/ext4/super.c
  * net: sched: fix `tc -s class show` no bstats on class with nolock subqueues
      net/sched/sch_mq.c
      net/sched/sch_multiq.c
      net/sched/sch_prio.c
  * sctp: cache netns in sctp_ep_common
      include/net/sctp/structs.h
    tipc: fix link name length check
    openvswitch: remove another BUG_ON()
    openvswitch: drop unneeded BUG_ON() in ovs_flow_cmd_build_info()
    slip: Fix use-after-free Read in slip_open
    openvswitch: fix flow command message size
    net: psample: fix skb_over_panic
    macvlan: schedule bc_work even if error
    media: atmel: atmel-isc: fix INIT_WORK misplacement
    media: atmel: atmel-isc: fix asd memory allocation
  * pwm: Clear chip_data in pwm_put()
      drivers/pwm/core.c
    net: macb: fix error format in dev_err()
  * media: v4l2-ctrl: fix flags for DO_WHITE_BALANCE
      drivers/media/v4l2-core/v4l2-ctrls.c
  * xfrm: Fix memleak on xfrm state destroy
      net/xfrm/xfrm_state.c
    mei: bus: prefix device names on bus with the bus name
    USB: serial: ftdi_sio: add device IDs for U-Blox C099-F9P
    staging: rtl8723bs: Add 024c:0525 to the list of SDIO device-ids
    staging: rtl8723bs: Drop ACPI device ids
    staging: rtl8192e: fix potential use after free
    clk: at91: generated: set audio_pll_allowed in at91_clk_register_generated()
    clk: at91: fix update bit maps on CFG_MOR write
    mm, gup: add missing refcount overflow checks on s390
    mtd: Remove a debug trace in mtdpart.c
    powerpc/pseries/dlpar: Fix a missing check in dlpar_parse_cc_property()
    scsi: libsas: Check SMP PHY control function result
    ACPI / APEI: Switch estatus pool to use vmalloc memory
    ACPI / APEI: Don't wait to serialise with oops messages when panic()ing
    scsi: libsas: Support SATA PHY connection rate unmatch fixing during discovery
    apparmor: delete the dentry in aafs_remove() to avoid a leak
    iommu/amd: Fix NULL dereference bug in match_hid_uid
    net: hns3: Change fw error code NOT_EXEC to NOT_SUPPORTED
  * bpf: drop refcount if bpf_map_new_fd() fails in map_create()
      kernel/bpf/syscall.c
    kvm: properly check debugfs dentry before using it
  * net: dev: Use unsigned integer as an argument to left-shift
      include/linux/netdevice.h
  * bpf: decrease usercnt if bpf_map_new_fd() fails in bpf_map_get_fd_by_id()
      kernel/bpf/syscall.c
    sctp: don't compare hb_timer expire date before starting it
  * net: fix possible overflow in __sk_mem_raise_allocated()
      include/net/sock.h
      net/core/sock.c
    sfc: initialise found bitmap in efx_ef10_mtd_probe
    tipc: fix skb may be leaky in tipc_link_input
  * blktrace: Show requests without sector
      include/linux/blktrace_api.h
    net/smc: prevent races between smc_lgr_terminate() and smc_conn_free()
    decnet: fix DN_IFREQ_SIZE
  * ip_tunnel: Make none-tunnel-dst tunnel port work with lwtunnel
      net/ipv4/ip_tunnel.c
    sfc: suppress duplicate nvmem partition types in efx_ef10_mtd_probe
    gpu: ipu-v3: pre: don't trigger update if buffer address doesn't change
    serial: 8250: Fix serial8250 initialization crash
  * net/core/neighbour: fix kmemleak minimal reference count for hash tables
      net/core/neighbour.c
  * PCI/MSI: Return -ENOSPC from pci_alloc_irq_vectors_affinity()
      drivers/pci/msi.c
  * net/core/neighbour: tell kmemleak about hash tables
      net/core/neighbour.c
    tipc: fix memory leak in tipc_nl_compat_publ_dump
    mtd: Check add_mtd_device() ret code
  * lib/genalloc.c: include vmalloc.h
      lib/genalloc.c
  * drivers/base/platform.c: kmemleak ignore a known leak
      drivers/base/platform.c
  * fork: fix some -Wmissing-prototypes warnings
      include/linux/sched/task.h
      init/main.c
      kernel/fork.c
  * lib/genalloc.c: use vzalloc_node() to allocate the bitmap
      lib/genalloc.c
  * lib/genalloc.c: fix allocation of aligned buffer from non-aligned chunk
      include/linux/genalloc.h
      lib/genalloc.c
  * vmscan: return NODE_RECLAIM_NOSCAN in node_reclaim() when CONFIG_NUMA is n
      include/linux/swap.h
      mm/internal.h
    ocfs2: clear journal dirty flag after shutdown journal
    net/wan/fsl_ucc_hdlc: Avoid double free in ucc_hdlc_probe()
    tipc: fix a missing check of genlmsg_put
    atl1e: checking the status of atl1e_write_phy_reg
    net: dsa: bcm_sf2: Propagate error value from mdio_write
    net: stmicro: fix a missing check of clk_prepare
    net: (cpts) fix a missing check of clk_prepare
    um: Make GCOV depend on !KCOV
  * f2fs: fix to dirty inode synchronously
      fs/f2fs/file.c
  * net/net_namespace: Check the return value of register_pernet_subsys()
      net/core/net_namespace.c
    net/netlink_compat: Fix a missing check of nla_parse_nested
    pwm: clps711x: Fix period calculation
    crypto: mxc-scc - fix build warnings on ARM64
    powerpc/pseries: Fix node leak in update_lmb_associativity_index()
    powerpc/83xx: handle machine check caused by watchdog timer
    regulator: tps65910: fix a missing check of return value
    IB/rxe: Make counters thread safe
    drbd: fix print_st_err()'s prototype to match the definition
    drbd: do not block when adjusting "disk-options" while IO is frozen
    drbd: reject attach of unsuitable uuids even if connected
    drbd: ignore "all zero" peer volume sizes in handshake
    powerpc/powernv/eeh/npu: Fix uninitialized variables in opal_pci_eeh_freeze_status
    vfio/spapr_tce: Get rid of possible infinite loop
    powerpc/44x/bamboo: Fix PCI range
    powerpc/mm: Make NULL pointer deferences explicit on bad page faults.
    powerpc/prom: fix early DEBUG messages
    powerpc/perf: Fix unit_sel/cache_sel checks
    ath6kl: Fix off by one error in scan completion
    ath6kl: Only use match sets when firmware supports it
    scsi: csiostor: fix incorrect dma device in case of vport
    scsi: qla2xxx: deadlock by configfs_depend_item
    RDMA/srp: Propagate ib_post_send() failures to the SCSI mid-layer
    openrisc: Fix broken paths to arch/or32
    serial: max310x: Fix tx_empty() callback
    Bluetooth: hci_bcm: Handle specific unknown packets after firmware loading
    drivers/regulator: fix a missing check of return value
    powerpc/xmon: fix dump_segments()
    powerpc/book3s/32: fix number of bats in p/v_block_mapped()
    vxlan: Fix error path in __vxlan_dev_create()
    clocksource/drivers/fttmr010: Fix invalid interrupt register access
    IB/qib: Fix an error code in qib_sdma_verbs_send()
    xfs: Fix bulkstat compat ioctls on x32 userspace.
    xfs: Align compat attrlist_by_handle with native implementation.
    gfs2: take jdata unstuff into account in do_grow
    dm flakey: Properly corrupt multi-page bios.
    HID: doc: fix wrong data structure reference for UHID_OUTPUT
    pinctrl: sh-pfc: sh7734: Fix shifted values in IPSR10
    pinctrl: sh-pfc: sh7264: Fix PFCR3 and PFCR0 register configuration
    KVM: s390: unregister debug feature on failing arch init
    bnxt_en: query force speeds before disabling autoneg mode.
    bnxt_en: Return linux standard errors in bnxt_ethtool.c
    exofs_mount(): fix leaks on failure exits
    net/mlx5: Continue driver initialization despite debugfs failure
    pinctrl: xway: fix gpio-hog related boot issues
    vfio-mdev/samples: Use u8 instead of char for handle functions
    xen/pciback: Check dev_data before using it
    kprobes/x86/xen: blacklist non-attachable xen interrupt functions
    serial: 8250: Rate limit serial port rx interrupts during input overruns
    HID: intel-ish-hid: fixes incorrect error handling
    btrfs: only track ref_heads in delayed_ref_updates
    mtd: rawnand: sunxi: Write pageprog related opcodes to WCMD_SET
    mmc: meson-gx: make sure the descriptor is stopped on errors
    VSOCK: bind to random port for VMADDR_PORT_ANY
    kvm: vmx: Set IA32_TSC_AUX for legacy mode guests
  * gpiolib: Fix return value of gpio_to_desc() stub if !GPIOLIB
      include/linux/gpio/consumer.h
    iwlwifi: move iwl_nvm_check_version() into dvm
    microblaze: move "... is ready" messages to arch/microblaze/Makefile
    microblaze: adjust the help to the real behavior
    ubi: Do not drop UBI device reference before using
    ubi: Put MTD device after it is not used
    xfs: require both realtime inodes to mount
    rtl818x: fix potential use after free
    mwifiex: debugfs: correct histogram spacing, formatting
    mwifiex: fix potential NULL dereference and use after free
    crypto: user - support incremental algorithm dumps
    scsi: lpfc: Enable Management features for IF_TYPE=6
    ACPI / LPSS: Ignore acpi_device_fix_up_power() return value
    ARM: ks8695: fix section mismatch warning
    PM / AVS: SmartReflex: NULL check before some freeing functions is not needed
    RDMA/vmw_pvrdma: Use atomic memory allocation in create AH
    ARM: OMAP1: fix USB configuration for device-only setups
  * arm64: smp: Handle errors reported by the firmware
      arch/arm64/kernel/smp.c
  * arm64: mm: Prevent mismatched 52-bit VA support
      arch/arm64/kernel/head.S
      arch/arm64/kernel/smp.c
    parisc: Fix HP SDC hpa address output
    parisc: Fix serio address output
    ARM: dts: imx53-voipac-dmm-668: Fix memory node duplication
    ARM: debug-imx: only define DEBUG_IMX_UART_PORT if needed
    ARM: dts: Fix up SQ201 flash access
    scsi: lpfc: Fix dif and first burst use in write commands
    scsi: lpfc: Fix kernel Oops due to null pring pointers
    pwm: bcm-iproc: Prevent unloading the driver module while in use
    block: drbd: remove a stray unlock in __drbd_send_protocol()
    mac80211: fix station inactive_time shortly after boot
    ceph: return -EINVAL if given fsc mount option on kernel w/o support
    net: bcmgenet: reapply manual settings to the PHY
    scripts/gdb: fix debugging modules compiled with hot/cold partitioning
    watchdog: meson: Fix the wrong value of left time
    can: rx-offload: can_rx_offload_irq_offload_fifo(): continue on error
    can: rx-offload: can_rx_offload_irq_offload_timestamp(): continue on error
    can: rx-offload: can_rx_offload_offload_one(): use ERR_PTR() to propagate error value in case of errors
    can: rx-offload: can_rx_offload_offload_one(): increment rx_fifo_errors on queue overflow or OOM
    can: rx-offload: can_rx_offload_offload_one(): do not increase the skb_queue beyond skb_queue_len_max
    can: rx-offload: can_rx_offload_queue_tail(): fix error handling, avoid skb mem leak
    can: c_can: D_CAN: c_can_chip_config(): perform a sofware reset on open
    can: peak_usb: report bus recovery as well
    bridge: ebtables: don't crash when using dnat target in output chains
    net: fec: add missed clk_disable_unprepare in remove
    clk: ti: dra7-atl-clock: Remove ti_clk_add_alias call
    x86/resctrl: Prevent NULL pointer dereference when reading mondata
  * idr: Fix idr_alloc_u32 on 32-bit systems
      lib/radix-tree.c
    clk: sunxi-ng: a80: fix the zero'ing of bits 16 and 18
    clk: at91: avoid sleeping early
  * reset: fix reset_control_ops kerneldoc comment
      include/linux/reset-controller.h
    clk: samsung: exynos5420: Preserve PLL configuration during suspend/resume
    ASoC: kirkwood: fix external clock probe defer
  * reset: Fix memory leak in reset_control_array_put()
      drivers/reset/core.c
  * ASoC: compress: fix unsigned integer overflow check
      sound/core/compress_offload.c
    ASoC: msm8916-wcd-analog: Fix RX1 selection in RDAC2 MUX
    clk: meson: gxbb: let sar_adc_clk_div set the parent clock rate
    Revert "KVM: nVMX: reset cache/shadows when switching loaded VMCS"
  * fs/lock: skip lock owner pid translation in case we are in init_pid_ns
      fs/locks.c
    UPSTREAM: dt-bindings: arm: coresight: Add support for coresight-loses-context-with-cpu
  * BACKPORT: coresight: etm4x: Save/restore state across CPU low power states
      include/linux/coresight.h
  * f2fs: stop GC when the victim becomes fully valid
      fs/f2fs/gc.c
  * f2fs: expose main_blkaddr in sysfs
      fs/f2fs/sysfs.c
  * f2fs: choose hardlimit when softlimit is larger than hardlimit in f2fs_statfs_project()
      fs/f2fs/super.c
  * f2fs: Fix deadlock in f2fs_gc() context during atomic files handling
      fs/f2fs/f2fs.h
      fs/f2fs/file.c
      fs/f2fs/segment.c
  * f2fs: show f2fs instance in printk_ratelimited
      fs/f2fs/checkpoint.c
      fs/f2fs/data.c
      fs/f2fs/dir.c
      fs/f2fs/f2fs.h
      fs/f2fs/file.c
      fs/f2fs/gc.c
      fs/f2fs/inode.c
      fs/f2fs/node.c
      fs/f2fs/segment.c
  * f2fs: fix potential overflow
      fs/f2fs/data.c
      fs/f2fs/file.c
  * f2fs: fix to update dir's i_pino during cross_rename
      fs/f2fs/namei.c
  * f2fs: support aligned pinned file
      fs/f2fs/f2fs.h
      fs/f2fs/file.c
      fs/f2fs/recovery.c
      fs/f2fs/segment.c
      fs/f2fs/segment.h
      fs/f2fs/super.c
      fs/f2fs/sysfs.c
  * f2fs: avoid kernel panic on corruption test
      fs/f2fs/node.c
    f2fs: fix wrong description in document
  * f2fs: cache global IPU bio
      fs/f2fs/data.c
      fs/f2fs/f2fs.h
      fs/f2fs/segment.c
      fs/f2fs/super.c
  * f2fs: fix to avoid memory leakage in f2fs_listxattr
      fs/f2fs/xattr.c
  * f2fs: check total_segments from devices in raw_super
      fs/f2fs/super.c
  * f2fs: update multi-dev metadata in resize_fs
      fs/f2fs/gc.c
  * f2fs: mark recovery flag correctly in read_raw_super_block()
      fs/f2fs/super.c
  * f2fs: fix to update time in lazytime mode
      fs/f2fs/f2fs.h
      fs/f2fs/inode.c
  * vfs: don't allow writes to swap files
      fs/block_dev.c
      include/linux/fs.h
      mm/filemap.c
      mm/memory.c
      mm/mmap.c
      mm/swapfile.c
  * mm: set S_SWAPFILE on blockdev swap devices
      mm/swapfile.c
    BACKPORT: ARM: 8900/1: UNWINDER_FRAME_POINTER implementation for Clang
    Merge 4.14.157 into android-4.14
Linux 4.14.157
    x86/hyperv: mark hyperv_init as __init function
    KVM: PPC: Book3S HV: Flush link stack on guest exit to host kernel
    powerpc/book3s64: Fix link stack flush on context switch
    powerpc/64s: support nospectre_v2 cmdline option
    staging: comedi: usbduxfast: usbduxfast_ai_cmdtest rounding error
    USB: serial: option: add support for Foxconn T77W968 LTE modules
    USB: serial: option: add support for DW5821e with eSIM support
    USB: serial: mos7840: fix remote wakeup
    USB: serial: mos7720: fix remote wakeup
    USB: serial: mos7840: add USB ID to support Moxa UPort 2210
    appledisplay: fix error handling in the scheduled work
    USB: chaoskey: fix error case of a timeout
    usb-serial: cp201x: support Mark-10 digital force gauge
    usbip: Fix uninitialized symbol 'nents' in stub_recv_cmd_submit()
    usbip: tools: fix fd leakage in the function of read_attr_usbip_status
    virtio_ring: fix return code on DMA mapping fails
    media: imon: invalid dereference in imon_touch_event
    media: cxusb: detect cxusb_ctrl_msg error in query
    media: b2c2-flexcop-usb: add sanity checking
    media: uvcvideo: Fix error path in control parsing failure
  * cpufreq: Add NULL checks to show() and store() methods of cpufreq
      drivers/cpufreq/cpufreq.c
    media: usbvision: Fix races among open, close, and disconnect
    media: vivid: Fix wrong locking that causes race conditions on streaming stop
    media: vivid: Set vid_cap_streaming and vid_out_streaming to true
    nfc: port100: handle command failure cleanly
    nbd: prevent memory leak
    x86/speculation: Fix redundant MDS mitigation message
    x86/speculation: Fix incorrect MDS/TAA mitigation status
    x86/insn: Fix awk regexp warnings
    ARC: perf: Accommodate big-endian CPU
    ARM: 8904/1: skip nomap memblocks while finding the lowmem/highmem boundary
    ocfs2: remove ocfs2_is_o2cb_active()
  * cpufreq: Skip cpufreq resume if it's not suspended
      drivers/cpufreq/cpufreq.c
  * arm64: fix for bad_mode() handler to always result in panic
      arch/arm64/kernel/traps.c
    net: phy: dp83867: increase SGMII autoneg timer duration
    net: phy: dp83867: fix speed 10 in sgmii mode
    mm/memory_hotplug: don't access uninitialized memmaps in shrink_zone_span()
    md/raid10: prevent access of uninitialized resync_pages offset
    ath9k_hw: fix uninitialized variable data
    ath10k: Fix a NULL-ptr-deref bug in ath10k_usb_alloc_urb_from_pipe
  * KVM: MMU: Do not treat ZONE_DEVICE pages as being reserved
      include/linux/kvm_host.h
    Bluetooth: Fix invalid-free in bcsp_close()
  * cfg80211: call disconnect_wk when AP stops
      net/wireless/ap.c
      net/wireless/core.h
      net/wireless/sme.c
  * ipv6: Fix handling of LLA with VRF and sockets bound to VRF
      net/ipv6/tcp_ipv6.c
    mm/memory_hotplug: Do not unlock when fails to take the device_hotplug_lock
    i2c: uniphier-f: fix timeout error after reading 8 bytes
    spi: omap2-mcspi: Fix DMA and FIFO event trigger size mismatch
    PCI: keystone: Use quirk to limit MRRS for K2G
    pinctrl: zynq: Use define directive for PIN_CONFIG_IO_STANDARD
    pinctrl: lpc18xx: Use define directive for PIN_CONFIG_GPIO_PIN_INT
  * pinctrl: qcom: spmi-gpio: fix gpio-hog related boot issues
      drivers/pinctrl/qcom/pinctrl-spmi-gpio.c
  * cfg80211: Prevent regulatory restore during STA disconnect in concurrent interfaces
      net/wireless/sme.c
    of: unittest: allow base devicetree to have symbol metadata
    net: bcmgenet: return correct value 'ret' from bcmgenet_power_down
    ACPICA: Use %d for signed int print formatting instead of %u
    vrf: mark skb for multicast or link-local as enslaved to VRF
    dlm: don't leak kernel pointer to userspace
    dlm: fix invalid free
    scsi: lpfc: Correct loss of fc4 type on remote port address change
    scsi: lpfc: fcoe: Fix link down issue after 1000+ link bounces
    scsi: megaraid_sas: Fix goto labels in error handling
    scsi: megaraid_sas: Fix msleep granularity
    scsi: mpt3sas: Fix driver modifying persistent data in Manufacturing page11
    scsi: mpt3sas: Don't modify EEDPTagMode field setting on SAS3.5 HBA devices
    scsi: mpt3sas: Fix Sync cache command failure during driver unload
    net: dsa: bcm_sf2: Turn on PHY to allow successful registration
    rtlwifi: rtl8192de: Fix misleading REG_MCUFWDL information
    wireless: airo: potential buffer overflow in sprintf()
    brcmsmac: never log "tid x is not agg'able" by default
    rtl8xxxu: Fix missing break in switch
    wlcore: Fix the return value in case of error in 'wlcore_vendor_cmd_smart_config_start()'
    wil6210: fix locking in wmi_call
    btrfs: avoid link error with CONFIG_NO_AUTO_INLINE
    audit: print empty EXECVE args
    clk: sunxi-ng: enable so-said LDOs for A64 SoC's pll-mipi clock
    openvswitch: fix linking without CONFIG_NF_CONNTRACK_LABELS
  * sched/fair: Don't increase sd->balance_interval on newidle balance
      kernel/sched/fair.c
  * sched/topology: Fix off by one bug
      kernel/sched/topology.c
  * net: do not abort bulk send on BQL status
      net/core/dev.c
    ocfs2: fix clusters leak in ocfs2_defrag_extent()
    ocfs2: don't put and assigning null to bh allocated outside
  * arm64: makefile fix build of .i file in external module case
      arch/arm64/Makefile
    ntb: intel: fix return value for ndev_vec_mask()
    ntb_netdev: fix sleep time mismatch
    net: hns3: bugfix for buffer not free problem during resetting
    igb: shorten maximum PHC timecounter update interval
  * mm/memory_hotplug: make add_memory() take the device_hotplug_lock
      include/linux/memory_hotplug.h
    fs/hfs/extent.c: fix array out of bounds read of array extent
    hfs: update timestamp on truncate()
    hfsplus: update timestamps on truncate()
    hfs: fix return value of hfs_get_block()
    hfsplus: fix return value of hfsplus_get_block()
    hfs: prevent btree data loss on ENOSPC
    hfsplus: prevent btree data loss on ENOSPC
    hfs: fix BUG on bnode parent update
    hfsplus: fix BUG on bnode parent update
  * linux/bitmap.h: fix type of nbits in bitmap_shift_right()
      include/linux/bitmap.h
  * linux/bitmap.h: handle constant zero-size bitmaps correctly
      include/linux/bitmap.h
    selftests/powerpc/cache_shape: Fix out-of-tree build
    selftests/powerpc/switch_endian: Fix out-of-tree build
    selftests/powerpc/signal: Fix out-of-tree build
    powerpc/xmon: Relax frame size for clang
  * vfs: avoid problematic remapping requests into partial EOF block
      fs/read_write.c
    um: Make line/tty semantics use true write IRQ
    i2c: uniphier-f: fix race condition when IRQ is cleared
    i2c: uniphier-f: fix occasional timeout error
    i2c: uniphier-f: make driver robust against concurrency
  * block: fix the DISCARD request merge
      block/blk-merge.c
    macsec: let the administrator set UP state even if lowerdev is down
    macsec: update operstate when lower device changes
  * mm/page-writeback.c: fix range_cyclic writeback vs writepages deadlock
      mm/page-writeback.c
    fs/ocfs2/dlm/dlmdebug.c: fix a sleep-in-atomic-context bug in dlm_print_one_mle()
  * arm64: lib: use C string functions with KASAN enabled
      arch/arm64/include/asm/string.h
      arch/arm64/kernel/arm64ksyms.c
      arch/arm64/lib/memchr.S
      arch/arm64/lib/memcmp.S
      arch/arm64/lib/strchr.S
      arch/arm64/lib/strcmp.S
      arch/arm64/lib/strlen.S
      arch/arm64/lib/strncmp.S
      arch/arm64/lib/strnlen.S
      arch/arm64/lib/strrchr.S
    sparc64: Rework xchg() definition to avoid warnings.
    powerpc/process: Fix flush_all_to_thread for SPE
  * bpf: devmap: fix wrong interface selection in notifier_call
      kernel/bpf/devmap.c
    thermal: rcar_thermal: Prevent hardware access during system suspend
    selftests: watchdog: Fix error message.
    selftests: watchdog: fix message when /dev/watchdog open fails
    selftests/ftrace: Fix to test kprobe $comm arg only if available
    mfd: max8997: Enale irq-wakeup unconditionally
    mfd: intel_soc_pmic_bxtwc: Chain power button IRQs as well
    mfd: mc13xxx-core: Fix PMIC shutdown when reading ADC values
    mfd: arizona: Correct calling of runtime_put_sync
    net: ethernet: ti: cpsw: unsync mcast entries while switch promisc mode
    qlcnic: fix a return in qlcnic_dcb_get_capability()
    mISDN: Fix type of switch control variable in ctrl_teimanager
  * f2fs: fix to spread clear_cold_data()
      fs/f2fs/data.c
      fs/f2fs/dir.c
      fs/f2fs/segment.c
    rtc: s35390a: Change buf's type to u8 in s35390a_init
    ceph: fix dentry leak in ceph_readdir_prepopulate
    powerpc/pseries: Export raw per-CPU VPA data via debugfs
    sparc: Fix parport build warnings.
    spi: omap2-mcspi: Set FIFO DMA trigger level to word length
    s390/perf: Return error when debug_register fails
    atm: zatm: Fix empty body Clang warnings
    sunrpc: safely reallow resvport min/max inversion
    SUNRPC: Fix a compile warning for cmpxchg64()
    dm raid: avoid bitmap with raid4/5/6 journal device
    usbip: tools: fix atoi() on non-null terminated string
    USB: misc: appledisplay: fix backlight update_status return code
    PCI: vmd: Detach resources after stopping root bus
    macintosh/windfarm_smu_sat: Fix debug output
    ALSA: i2c/cs8427: Fix int to char conversion
  * PM / Domains: Deal with multiple states but no governor in genpd
      drivers/base/power/domain.c
    kprobes, x86/ptrace.h: Make regs_get_kernel_stack_nth() not fault on bad stack
    xfs: fix use-after-free race in xfs_buf_rele
  * net: ena: Fix Kconfig dependency on X86
      drivers/net/ethernet/amazon/Kconfig
  * net: fix warning in af_unix
      net/unix/af_unix.c
    net: dsa: mv88e6xxx: Fix 88E6141/6341 2500mbps SERDES speed
    scsi: dc395x: fix DMA API usage in sg_update_list
    scsi: dc395x: fix dma API usage in srb_done
    ASoC: tegra_sgtl5000: fix device_node refcounting
    clk: at91: audio-pll: fix audio pmc type
    clk: mmp2: fix the clock id for sdh2_clk and sdh3_clk
    nvmet-fcloop: suppress a compiler warning
    crypto: ccree - avoid implicit enum conversion
    scsi: iscsi_tcp: Explicitly cast param in iscsi_sw_tcp_host_get_param
    scsi: isci: Change sci_controller_start_task's return type to sci_status
    scsi: isci: Use proper enumerated type in atapi_d2h_reg_frame_handler
    KVM/x86: Fix invvpid and invept register operand size in 64-bit mode
    KVM: nVMX: reset cache/shadows when switching loaded VMCS
    scsi: ips: fix missing break in switch
    qed: Align local and global PTT to propagate through the APIs.
    amiflop: clean up on errors during setup
    pwm: lpss: Only set update bit if we are actually changing the settings
    pinctrl: sunxi: Fix a memory leak in 'sunxi_pinctrl_build_state()'
    RDMA/bnxt_re: Fix qp async event reporting
    m68k: fix command-line parsing when passed from u-boot
    w1: IAD Register is yet readable trough iad sys file. Fix snprintf (%u for unsigned, count for max size).
    misc: mic: fix a DMA pool free failure
    gsmi: Fix bug in append_to_eventlog sysfs handler
    btrfs: handle error of get_old_root
    mmc: mediatek: fix cannot receive new request when msdc_cmd_is_ready fail
    spi: sh-msiof: fix deferred probing
    cdrom: don't attempt to fiddle with cdo->capability
    skd: fixup usage of legacy IO API
    ath10k: allocate small size dma memory in ath10k_pci_diag_write_mem
    brcmsmac: AP mode: update beacon when TIM changes
    EDAC, thunderx: Fix memory leak in thunderx_l2c_threaded_isr()
    powerpc/eeh: Fix use of EEH_PE_KEEP on wrong field
    powerpc/boot: Disable vector instructions
    powerpc: Fix signedness bug in update_flash_db()
    synclink_gt(): fix compat_ioctl()
  * pty: fix compat ioctls
      drivers/tty/pty.c
    gfs2: Fix marking bitmaps non-full
  * printk: fix integer overflow in setup_log_buf()
      kernel/printk/printk.c
    ALSA: isight: fix leak of reference to firewire unit in error path of .probe callback
    mwifiex: Fix NL80211_TX_POWER_LIMITED
    platform/x86: asus-wmi: Only Tell EC the OS will handle display hotkeys from asus_nb_wmi
    platform/x86: asus-nb-wmi: Support ALS on the Zenbook UX430UQ
    drm/i915/userptr: Try to acquire the page lock around set_page_dirty()
    mm/ksm.c: don't WARN if page is still mapped in remove_stable_node()
    Revert "fs: ocfs2: fix possible null-pointer dereferences in ocfs2_xa_prepare_entry()"
    virtio_console: allocate inbufs in add_port() only if it is needed
    nbd:fix memory leak in nbd_get_socket()
    tools: gpio: Correctly add make dependencies for gpio_utils
    gpio: max77620: Fixup debounce delays
    vhost/vsock: split packets to send using multiple buffers
    net/sched: act_pedit: fix WARN() in the traffic path
    net/mlxfw: Verify FSM error code translation doesn't exceed array size
    net/mlx5e: Fix set vf link state error flow
    sfc: Only cancel the PPS workqueue if it exists
  * net: rtnetlink: prevent underflows in do_setvfinfo()
      net/core/rtnetlink.c
    net/mlx4_en: fix mlx4 ethtool -N insertion
    ANDROID: removed CONFIG_PM_WAKELOCKS
    Merge 4.14.156 into android-4.14
Linux 4.14.156
    mmc: tmio: fix SCC error handling to avoid false positive CRC error
    powerpc/time: Fix clockevent_decrementer initalisation for PR KVM
    tools: PCI: Fix broken pcitest compilation
    ARM: dts: omap5: Fix dual-role mode on Super-Speed port
    mlxsw: spectrum_switchdev: Check notification relevance based on upper device
    spi: rockchip: initialize dma_slave_config properly
    mac80211: minstrel: fix sampling/reporting of CCK rates in HT mode
    mac80211: minstrel: fix CCK rate group streams value
    mac80211: minstrel: fix using short preamble CCK rates on HT clients
    misc: cxl: Fix possible null pointer dereference
    netfilter: nft_compat: do not dump private area
    hwmon: (ina3221) Fix INA3221_CONFIG_MODE macros
    hwmon: (pwm-fan) Silence error on probe deferral
    pinctrl: gemini: Fix up TVC clock group
    orangefs: rate limit the client not running info message
    ARM: 8802/1: Call syscall_trace_exit even when system call skipped
  * spi: spidev: Fix OF tree warning logic
      drivers/spi/spidev.c
    pinctrl: gemini: Mask and set properly
    spi: fsl-lpspi: Prevent FIFO under/overrun by default
    gpio: syscon: Fix possible NULL ptr usage
    x86/kexec: Correct KEXEC_BACKUP_SRC_END off-by-one error
    media: cx231xx: fix potential sign-extension overflow on large shift
    GFS2: Flush the GFS2 delete workqueue before stopping the kernel threads
    media: isif: fix a NULL pointer dereference bug
  * printk: Give error on attempt to set log buffer length to over 2G
      kernel/printk/printk.c
    mfd: ti_am335x_tscadc: Keep ADC interface on if child is wakeup capable
    backlight: lm3639: Unconditionally call led_classdev_unregister
    proc/vmcore: Fix i386 build error of missing copy_oldmem_page_encrypted()
    s390/kasan: avoid vdso instrumentation
    media: dw9714: Fix error handling in probe function
    bcache: recal cached_dev_sectors on detach
  * reset: Fix potential use-after-free in __of_reset_control_get()
      drivers/reset/core.c
    fbdev: sbuslib: integer overflow in sbusfb_ioctl_helper()
    fbdev: sbuslib: use checked version of put_user()
    mmc: tmio: Fix SCC error detection
    x86/fsgsbase/64: Fix ptrace() to read the FS/GS base accurately
    iwlwifi: mvm: don't send keys when entering D3
    ACPI / SBS: Fix rare oops when removing modules
  * xfrm: use correct size to initialise sp->ovec
      net/xfrm/xfrm_input.c
    crypto: mxs-dcp - Fix AES issues
    crypto: mxs-dcp - Fix SHA null hashes and output length
    dmaengine: rcar-dmac: set scatter/gather max segment size
    x86/olpc: Fix build error with CONFIG_MFD_CS5535=m
    kexec: Allocate decrypted control pages for kdump if SME is enabled
    remoteproc: Check for NULL firmwares in sysfs interface
    Input: silead - try firmware reload after unsuccessful resume
    Input: st1232 - set INPUT_PROP_DIRECT property
    media: cec-gpio: select correct Signal Free Time
    dmaengine: ioat: fix prototype of ioat_enumerate_channels
    NFSv4.x: fix lock recovery during delegation recall
  * i2c: brcmstb: Allow enabling the driver on DSL SoCs
      drivers/i2c/busses/Kconfig
    clk: samsung: Use clk_hw API for calling clk framework from clk notifiers
    clk: samsung: exynos5420: Define CLK_SECKEY gate clock only or Exynos5420
    qtnfmac: drop error reports for out-of-bounds key indexes
    qtnfmac: pass sgi rate info flag to wireless core
    brcmfmac: fix full timeout waiting for action frame on-channel tx
    brcmfmac: reduce timeout for action frame scan
  * cpu/SMT: State SMT is disabled even with nosmt and without "=force"
      kernel/cpu.c
    mtd: physmap_of: Release resources on error
    USB: serial: cypress_m8: fix interrupt-out transfer length
    KVM: PPC: Book3S PR: Exiting split hack mode needs to fixup both PC and LR
    ALSA: hda/sigmatel - Disable automute for Elo VuPoint
    media: i2c: adv748x: Support probing a single output
    media: pxa_camera: Fix check for pdev->dev.of_node
    media: rc: ir-rc6-decoder: enable toggle bit for Kathrein RCU-676 remote
    ata: ep93xx: Use proper enums for directions
    powerpc/time: Use clockevents_register_device(), fixing an issue with large decrementer
    ACPICA: Never run _REG on system_memory and system_IO
  * IB/mlx4: Avoid implicit enumerated type conversion
      include/rdma/ib_verbs.h
    IB/mthca: Fix error return code in __mthca_init_one()
    ixgbe: Fix crash with VFs and flow director on interface flap
    i40e: Use proper enum in i40e_ndo_set_vf_link_state
    ixgbe: Fix ixgbe TX hangs with XDP_TX beyond queue limit
    md: allow metadata updates while suspending an array - fix
    clocksource/drivers/sh_cmt: Fix clocksource width for 32-bit machines
    clocksource/drivers/sh_cmt: Fixup for 64-bit machines
    tools: PCI: Fix compilation warnings
    PM / hibernate: Check the success of generating md5 digest before hibernation
    mtd: rawnand: sh_flctl: Use proper enum for flctl_dma_fifo0_transfer
    ARM: dts: at91: at91sam9x5cm: fix addressable nand flash size
    ARM: dts: at91: sama5d4_xplained: fix addressable nand flash size
    powerpc/xive: Move a dereference below a NULL test
    powerpc/pseries: Fix how we iterate over the DTL entries
    powerpc/pseries: Fix DTL buffer registration
    cxgb4: Use proper enum in IEEE_FAUX_SYNC
    cxgb4: Use proper enum in cxgb4_dcb_handle_fw_update
    mei: samples: fix a signedness bug in amt_host_if_call()
    sunrpc: Fix connect metrics
  * clk: keystone: Enable TISCI clocks if K3_ARCH
      drivers/clk/Makefile
      drivers/clk/keystone/Kconfig
  * ext4: fix build error when DX_DEBUG is defined
      fs/ext4/namei.c
    dmaengine: timb_dma: Use proper enum in td_prep_slave_sg
    dmaengine: ep93xx: Return proper enum in ep93xx_dma_chan_direction
    KVM: PPC: Inform the userspace about TCE update failures
    watchdog: w83627hf_wdt: Support NCT6796D, NCT6797D, NCT6798D
    irqchip/irq-mvebu-icu: Fix wrong private data retrieval
  * nl80211: Fix a GET_KEY reply attribute
      net/wireless/nl80211.c
  * usb: dwc3: gadget: Check ENBLSLPM before sending ep command
      drivers/usb/dwc3/gadget.c
    usb: gadget: udc: fotg210-udc: Fix a sleep-in-atomic-context bug in fotg210_get_status()
    ath9k: fix reporting calculated new FFT upper max
  * ata: ahci_brcm: Allow using driver or DSL SoCs
      drivers/ata/Kconfig
    ath10k: fix vdev-start timeout on error
    arm64/numa: Report correct memblock range for the dummy node
    kvm: arm/arm64: Fix stage2_flush_memslot for 4 level page table
  * iommu/io-pgtable-arm: Fix race handling in split_blk_unmap()
      drivers/iommu/io-pgtable-arm.c
    IB/hfi1: Ensure ucast_dlid access doesnt exceed bounds
  * SUNRPC: Fix priority queue fairness
      include/linux/sunrpc/sched.h
    ARM: dts: sun8i: h3-h5: ir register size should be the whole memory block
  * f2fs: return correct errno in f2fs_gc
      fs/f2fs/gc.c
    net: hns3: Fix for netdev not up problem when setting mtu
    ARM: dts: omap5: enable OTG role for DWC3 controller
    ARM: dts: dra7: Enable workaround for errata i870 in PCIe host mode
    net: xen-netback: fix return type of ndo_start_xmit function
    net: ovs: fix return type of ndo_start_xmit function
  * fbdev: Ditch fb_edid_add_monspecs
      drivers/video/fbdev/core/fbmon.c
      drivers/video/fbdev/core/modedb.c
      include/linux/fb.h
  * arm64: uaccess: Ensure PAN is re-enabled after unhandled uaccess fault
      arch/arm64/lib/clear_user.S
      arch/arm64/lib/copy_from_user.S
      arch/arm64/lib/copy_in_user.S
      arch/arm64/lib/copy_to_user.S
    mm/memory_hotplug: fix updating the node span
    mm/memory_hotplug: don't access uninitialized memmaps in shrink_pgdat_span()
  * idr: Fix idr_get_next race with idr_remove
      lib/idr.c
  * net: cdc_ncm: Signedness bug in cdc_ncm_set_dgram_size()
      drivers/net/usb/cdc_ncm.c
  * Revert "OPP: Protect dev_list with opp_table lock"
      drivers/base/power/opp/core.c
      drivers/base/power/opp/cpu.c
      drivers/base/power/opp/opp.h
    tee: optee: add missing of_node_put after of_device_is_available
    spi: mediatek: use correct mata->xfer_len when in fifo transfer
  * FROMGIT: pinctrl: devicetree: Avoid taking direct reference to device name string
      drivers/pinctrl/devicetree.c

Change-Id: I9c31a0c869961bd1866ef774568945fa54f0313d
Signed-off-by: Robin Peng <robinpeng@google.com>
2020-01-06 19:02:28 +08:00
David Hildenbrand
5cb8388a68 mm/memory_hotplug: make add_memory() take the device_hotplug_lock
[ Upstream commit 8df1d0e4a265f25dc1e7e7624ccdbcb4a6630c89 ]

add_memory() currently does not take the device_hotplug_lock, however
is aleady called under the lock from
	arch/powerpc/platforms/pseries/hotplug-memory.c
	drivers/acpi/acpi_memhotplug.c
to synchronize against CPU hot-remove and similar.

In general, we should hold the device_hotplug_lock when adding memory to
synchronize against online/offline request (e.g.  from user space) - which
already resulted in lock inversions due to device_lock() and
mem_hotplug_lock - see 30467e0b3b ("mm, hotplug: fix concurrent memory
hot-add deadlock").  add_memory()/add_memory_resource() will create memory
block devices, so this really feels like the right thing to do.

Holding the device_hotplug_lock makes sure that a memory block device
can really only be accessed (e.g. via .online/.state) from user space,
once the memory has been fully added to the system.

The lock is not held yet in
	drivers/xen/balloon.c
	arch/powerpc/platforms/powernv/memtrace.c
	drivers/s390/char/sclp_cmd.c
	drivers/hv/hv_balloon.c
So, let's either use the locked variants or take the lock.

Don't export add_memory_resource(), as it once was exported to be used by
XEN, which is never built as a module.  If somebody requires it, we also
have to export a locked variant (as device_hotplug_lock is never
exported).

Link: http://lkml.kernel.org/r/20180925091457.28651-3-david@redhat.com
Signed-off-by: David Hildenbrand <david@redhat.com>
Reviewed-by: Pavel Tatashin <pavel.tatashin@microsoft.com>
Reviewed-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Reviewed-by: Rashmica Gupta <rashmica.g@gmail.com>
Reviewed-by: Oscar Salvador <osalvador@suse.de>
Cc: Benjamin Herrenschmidt <benh@kernel.crashing.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Michael Ellerman <mpe@ellerman.id.au>
Cc: "Rafael J. Wysocki" <rjw@rjwysocki.net>
Cc: Len Brown <lenb@kernel.org>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Boris Ostrovsky <boris.ostrovsky@oracle.com>
Cc: Juergen Gross <jgross@suse.com>
Cc: Nathan Fontenot <nfont@linux.vnet.ibm.com>
Cc: John Allen <jallen@linux.vnet.ibm.com>
Cc: Michal Hocko <mhocko@suse.com>
Cc: Dan Williams <dan.j.williams@intel.com>
Cc: Joonsoo Kim <iamjoonsoo.kim@lge.com>
Cc: Vlastimil Babka <vbabka@suse.cz>
Cc: Mathieu Malaterre <malat@debian.org>
Cc: Pavel Tatashin <pavel.tatashin@microsoft.com>
Cc: YASUAKI ISHIMATSU <yasu.isimatu@gmail.com>
Cc: Balbir Singh <bsingharora@gmail.com>
Cc: Haiyang Zhang <haiyangz@microsoft.com>
Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
Cc: Jonathan Corbet <corbet@lwn.net>
Cc: Kate Stewart <kstewart@linuxfoundation.org>
Cc: "K. Y. Srinivasan" <kys@microsoft.com>
Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
Cc: Michael Neuling <mikey@neuling.org>
Cc: Philippe Ombredanne <pombredanne@nexb.com>
Cc: Stephen Hemminger <sthemmin@microsoft.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: Sasha Levin <sashal@kernel.org>
2019-12-01 09:13:57 +01:00
Sudarshan Rajagopalan
0800fc3a04 mm/memory_hotplug: Reduce migration re-try timeout value
Currently, the timeout value for migration re-tries during
memory offlining is 120 seconds. If migration encounters a
pinned page, it is guaranteed to fail no matter how many
re-tries are attempted, and there is much CPU cycles spent
on the re-tries with this timeout, which would contribute
to power. Hence, reduce the timeout to half, 60 seconds.

Change-Id: Ib300c4c1c3c4f5d10f5d8652d04a783c0f459fc1
Signed-off-by: Sudarshan Rajagopalan <sudaraja@codeaurora.org>
2018-08-22 14:46:23 -07:00
Liam Mark
7742a977b5 mm, oom: Try to online memory block before killing
Before killing a process first try to see if there are any offlined
memory blocks, if there are try to online one of them.

Change-Id: Ie97c8b69f0ba173c202a891a38a5c914869ddaae
Signed-off-by: Liam Mark <lmark@codeaurora.org>
2018-06-06 17:36:08 -07:00
Arun KS
5768f08b8d arm64: Honor limits set by bootloader
Introduce a variable to save bootloader enforced memory limits and
restricts adding beyond this boundary during a memory hotplug. Also,
export this symbol so that other kernel module have access to it.

Change-Id: I28c100644b7287ec4625c4c018b5fffc865e2e72
Signed-off-by: Arun KS <arunks@codeaurora.org>
[sudaraja@codeaurora.org: check limit with physical address of page]
Signed-off-by: Sudarshan Rajagopalan <sudaraja@codeaurora.org>
2018-05-16 18:09:13 -07:00
Greg Kroah-Hartman
b24413180f License cleanup: add SPDX GPL-2.0 license identifier to files with no license
Many source files in the tree are missing licensing information, which
makes it harder for compliance tools to determine the correct license.

By default all files without license information are under the default
license of the kernel, which is GPL version 2.

Update the files which contain no license information with the 'GPL-2.0'
SPDX license identifier.  The SPDX identifier is a legally binding
shorthand, which can be used instead of the full boiler plate text.

This patch is based on work done by Thomas Gleixner and Kate Stewart and
Philippe Ombredanne.

How this work was done:

Patches were generated and checked against linux-4.14-rc6 for a subset of
the use cases:
 - file had no licensing information it it.
 - file was a */uapi/* one with no licensing information in it,
 - file was a */uapi/* one with existing licensing information,

Further patches will be generated in subsequent months to fix up cases
where non-standard license headers were used, and references to license
had to be inferred by heuristics based on keywords.

The analysis to determine which SPDX License Identifier to be applied to
a file was done in a spreadsheet of side by side results from of the
output of two independent scanners (ScanCode & Windriver) producing SPDX
tag:value files created by Philippe Ombredanne.  Philippe prepared the
base worksheet, and did an initial spot review of a few 1000 files.

The 4.13 kernel was the starting point of the analysis with 60,537 files
assessed.  Kate Stewart did a file by file comparison of the scanner
results in the spreadsheet to determine which SPDX license identifier(s)
to be applied to the file. She confirmed any determination that was not
immediately clear with lawyers working with the Linux Foundation.

Criteria used to select files for SPDX license identifier tagging was:
 - Files considered eligible had to be source code files.
 - Make and config files were included as candidates if they contained >5
   lines of source
 - File already had some variant of a license header in it (even if <5
   lines).

All documentation files were explicitly excluded.

The following heuristics were used to determine which SPDX license
identifiers to apply.

 - when both scanners couldn't find any license traces, file was
   considered to have no license information in it, and the top level
   COPYING file license applied.

   For non */uapi/* files that summary was:

   SPDX license identifier                            # files
   ---------------------------------------------------|-------
   GPL-2.0                                              11139

   and resulted in the first patch in this series.

   If that file was a */uapi/* path one, it was "GPL-2.0 WITH
   Linux-syscall-note" otherwise it was "GPL-2.0".  Results of that was:

   SPDX license identifier                            # files
   ---------------------------------------------------|-------
   GPL-2.0 WITH Linux-syscall-note                        930

   and resulted in the second patch in this series.

 - if a file had some form of licensing information in it, and was one
   of the */uapi/* ones, it was denoted with the Linux-syscall-note if
   any GPL family license was found in the file or had no licensing in
   it (per prior point).  Results summary:

   SPDX license identifier                            # files
   ---------------------------------------------------|------
   GPL-2.0 WITH Linux-syscall-note                       270
   GPL-2.0+ WITH Linux-syscall-note                      169
   ((GPL-2.0 WITH Linux-syscall-note) OR BSD-2-Clause)    21
   ((GPL-2.0 WITH Linux-syscall-note) OR BSD-3-Clause)    17
   LGPL-2.1+ WITH Linux-syscall-note                      15
   GPL-1.0+ WITH Linux-syscall-note                       14
   ((GPL-2.0+ WITH Linux-syscall-note) OR BSD-3-Clause)    5
   LGPL-2.0+ WITH Linux-syscall-note                       4
   LGPL-2.1 WITH Linux-syscall-note                        3
   ((GPL-2.0 WITH Linux-syscall-note) OR MIT)              3
   ((GPL-2.0 WITH Linux-syscall-note) AND MIT)             1

   and that resulted in the third patch in this series.

 - when the two scanners agreed on the detected license(s), that became
   the concluded license(s).

 - when there was disagreement between the two scanners (one detected a
   license but the other didn't, or they both detected different
   licenses) a manual inspection of the file occurred.

 - In most cases a manual inspection of the information in the file
   resulted in a clear resolution of the license that should apply (and
   which scanner probably needed to revisit its heuristics).

 - When it was not immediately clear, the license identifier was
   confirmed with lawyers working with the Linux Foundation.

 - If there was any question as to the appropriate license identifier,
   the file was flagged for further research and to be revisited later
   in time.

In total, over 70 hours of logged manual review was done on the
spreadsheet to determine the SPDX license identifiers to apply to the
source files by Kate, Philippe, Thomas and, in some cases, confirmation
by lawyers working with the Linux Foundation.

Kate also obtained a third independent scan of the 4.13 code base from
FOSSology, and compared selected files where the other two scanners
disagreed against that SPDX file, to see if there was new insights.  The
Windriver scanner is based on an older version of FOSSology in part, so
they are related.

Thomas did random spot checks in about 500 files from the spreadsheets
for the uapi headers and agreed with SPDX license identifier in the
files he inspected. For the non-uapi files Thomas did random spot checks
in about 15000 files.

In initial set of patches against 4.14-rc6, 3 files were found to have
copy/paste license identifier errors, and have been fixed to reflect the
correct identifier.

Additionally Philippe spent 10 hours this week doing a detailed manual
inspection and review of the 12,461 patched files from the initial patch
version early this week with:
 - a full scancode scan run, collecting the matched texts, detected
   license ids and scores
 - reviewing anything where there was a license detected (about 500+
   files) to ensure that the applied SPDX license was correct
 - reviewing anything where there was no detection but the patch license
   was not GPL-2.0 WITH Linux-syscall-note to ensure that the applied
   SPDX license was correct

This produced a worksheet with 20 files needing minor correction.  This
worksheet was then exported into 3 different .csv files for the
different types of files to be modified.

These .csv files were then reviewed by Greg.  Thomas wrote a script to
parse the csv files and add the proper SPDX tag to the file, in the
format that the file expected.  This script was further refined by Greg
based on the output to detect more types of files automatically and to
distinguish between header and source .c files (which need different
comment types.)  Finally Greg ran the script using the .csv files to
generate the patches.

Reviewed-by: Kate Stewart <kstewart@linuxfoundation.org>
Reviewed-by: Philippe Ombredanne <pombredanne@nexb.com>
Reviewed-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
2017-11-02 11:10:55 +01:00
Michal Hocko
3072e413e3 mm/memory_hotplug: introduce add_pages
There are new users of memory hotplug emerging.  Some of them require
different subset of arch_add_memory.  There are some which only require
allocation of struct pages without mapping those pages to the kernel
address space.  We currently have __add_pages for that purpose.  But this
is rather lowlevel and not very suitable for the code outside of the
memory hotplug.  E.g.  x86_64 wants to update max_pfn which should be done
by the caller.  Introduce add_pages() which should care about those
details if they are needed.  Each architecture should define its
implementation and select CONFIG_ARCH_HAS_ADD_PAGES.  All others use the
currently existing __add_pages.

Link: http://lkml.kernel.org/r/20170817000548.32038-7-jglisse@redhat.com
Signed-off-by: Michal Hocko <mhocko@suse.com>
Signed-off-by: Jérôme Glisse <jglisse@redhat.com>
Acked-by: Balbir Singh <bsingharora@gmail.com>
Cc: Aneesh Kumar <aneesh.kumar@linux.vnet.ibm.com>
Cc: Benjamin Herrenschmidt <benh@kernel.crashing.org>
Cc: Dan Williams <dan.j.williams@intel.com>
Cc: David Nellans <dnellans@nvidia.com>
Cc: Evgeny Baskakov <ebaskakov@nvidia.com>
Cc: Johannes Weiner <hannes@cmpxchg.org>
Cc: John Hubbard <jhubbard@nvidia.com>
Cc: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
Cc: Mark Hairgrove <mhairgrove@nvidia.com>
Cc: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
Cc: Ross Zwisler <ross.zwisler@linux.intel.com>
Cc: Sherry Cheung <SCheung@nvidia.com>
Cc: Subhash Gutti <sgutti@nvidia.com>
Cc: Vladimir Davydov <vdavydov.dev@gmail.com>
Cc: Bob Liu <liubo95@huawei.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2017-09-08 18:26:46 -07:00
Michal Hocko
e5e6893026 mm, memory_hotplug: display allowed zones in the preferred ordering
Prior to commit f1dd2cd13c ("mm, memory_hotplug: do not associate
hotadded memory to zones until online") we used to allow to change the
valid zone types of a memory block if it is adjacent to a different zone
type.

This fact was reflected in memoryNN/valid_zones by the ordering of
printed zones.  The first one was default (echo online > memoryNN/state)
and the other one could be onlined explicitly by online_{movable,kernel}.

This behavior was removed by the said patch and as such the ordering was
not all that important.  In most cases a kernel zone would be default
anyway.  The only exception is movable_node handled by "mm,
memory_hotplug: support movable_node for hotpluggable nodes".

Let's reintroduce this behavior again because later patch will remove
the zone overlap restriction and so user will be allowed to online
kernel resp.  movable block regardless of its placement.  Original
behavior will then become significant again because it would be
non-trivial for users to see what is the default zone to online into.

Implementation is really simple.  Pull out zone selection out of
move_pfn_range into zone_for_pfn_range helper and use it in
show_valid_zones to display the zone for default onlining and then both
kernel and movable if they are allowed.  Default online zone is not
duplicated.

Link: http://lkml.kernel.org/r/20170714121233.16861-2-mhocko@kernel.org
Signed-off-by: Michal Hocko <mhocko@suse.com>
Acked-by: Joonsoo Kim <iamjoonsoo.kim@lge.com>
Acked-by: Vlastimil Babka <vbabka@suse.cz>
Cc: Mel Gorman <mgorman@suse.de>
Cc: Andrea Arcangeli <aarcange@redhat.com>
Cc: Reza Arbab <arbab@linux.vnet.ibm.com>
Cc: Yasuaki Ishimatsu <yasu.isimatu@gmail.com>
Cc: Xishi Qiu <qiuxishi@huawei.com>
Cc: Kani Toshimitsu <toshi.kani@hpe.com>
Cc: <slaoub@gmail.com>
Cc: Daniel Kiper <daniel.kiper@oracle.com>
Cc: Igor Mammedov <imammedo@redhat.com>
Cc: Vitaly Kuznetsov <vkuznets@redhat.com>
Cc: Wei Yang <richard.weiyang@gmail.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2017-09-06 17:27:25 -07:00
Michal Hocko
4932381ee2 mm, memory_hotplug: move movable_node to the hotplug proper
movable_node_is_enabled is defined in memblock proper while it is
initialized from the memory hotplug proper.  This is quite messy and it
makes a dependency between the two so move movable_node along with the
helper functions to memory_hotplug.

To make it more entertaining the kernel parameter is ignored unless
CONFIG_HAVE_MEMBLOCK_NODE_MAP=y because we do not have the node
information for each memblock otherwise.  So let's warn when the option
is disabled.

Link: http://lkml.kernel.org/r/20170529114141.536-4-mhocko@kernel.org
Signed-off-by: Michal Hocko <mhocko@suse.com>
Acked-by: Reza Arbab <arbab@linux.vnet.ibm.com>
Acked-by: Vlastimil Babka <vbabka@suse.cz>
Cc: Mel Gorman <mgorman@suse.de>
Cc: Andrea Arcangeli <aarcange@redhat.com>
Cc: Jerome Glisse <jglisse@redhat.com>
Cc: Yasuaki Ishimatsu <yasu.isimatu@gmail.com>
Cc: Xishi Qiu <qiuxishi@huawei.com>
Cc: Kani Toshimitsu <toshi.kani@hpe.com>
Cc: Chen Yucong <slaoub@gmail.com>
Cc: Joonsoo Kim <js1304@gmail.com>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Daniel Kiper <daniel.kiper@oracle.com>
Cc: Igor Mammedov <imammedo@redhat.com>
Cc: Vitaly Kuznetsov <vkuznets@redhat.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2017-07-06 16:24:35 -07:00
Michal Hocko
559bfc7d1b mm, memory_hotplug: remove unused cruft after memory hotplug rework
zone_for_memory doesn't have any user anymore as well as the whole zone
shifting infrastructure so drop them all.

This shouldn't introduce any functional changes.

Link: http://lkml.kernel.org/r/20170515085827.16474-15-mhocko@kernel.org
Signed-off-by: Michal Hocko <mhocko@suse.com>
Acked-by: Vlastimil Babka <vbabka@suse.cz>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Andrea Arcangeli <aarcange@redhat.com>
Cc: Balbir Singh <bsingharora@gmail.com>
Cc: Dan Williams <dan.j.williams@intel.com>
Cc: Daniel Kiper <daniel.kiper@oracle.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
Cc: Igor Mammedov <imammedo@redhat.com>
Cc: Jerome Glisse <jglisse@redhat.com>
Cc: Joonsoo Kim <js1304@gmail.com>
Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
Cc: Mel Gorman <mgorman@suse.de>
Cc: Reza Arbab <arbab@linux.vnet.ibm.com>
Cc: Tobias Regnery <tobias.regnery@gmail.com>
Cc: Toshi Kani <toshi.kani@hpe.com>
Cc: Vitaly Kuznetsov <vkuznets@redhat.com>
Cc: Xishi Qiu <qiuxishi@huawei.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2017-07-06 16:24:32 -07:00
Michal Hocko
3d79a728f9 mm, memory_hotplug: replace for_device by want_memblock in arch_add_memory
arch_add_memory gets for_device argument which then controls whether we
want to create memblocks for created memory sections.  Simplify the
logic by telling whether we want memblocks directly rather than going
through pointless negation.  This also makes the api easier to
understand because it is clear what we want rather than nothing telling
for_device which can mean anything.

This shouldn't introduce any functional change.

Link: http://lkml.kernel.org/r/20170515085827.16474-13-mhocko@kernel.org
Signed-off-by: Michal Hocko <mhocko@suse.com>
Tested-by: Dan Williams <dan.j.williams@intel.com>
Acked-by: Vlastimil Babka <vbabka@suse.cz>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Andrea Arcangeli <aarcange@redhat.com>
Cc: Balbir Singh <bsingharora@gmail.com>
Cc: Daniel Kiper <daniel.kiper@oracle.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
Cc: Igor Mammedov <imammedo@redhat.com>
Cc: Jerome Glisse <jglisse@redhat.com>
Cc: Joonsoo Kim <js1304@gmail.com>
Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
Cc: Mel Gorman <mgorman@suse.de>
Cc: Reza Arbab <arbab@linux.vnet.ibm.com>
Cc: Tobias Regnery <tobias.regnery@gmail.com>
Cc: Toshi Kani <toshi.kani@hpe.com>
Cc: Vitaly Kuznetsov <vkuznets@redhat.com>
Cc: Xishi Qiu <qiuxishi@huawei.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2017-07-06 16:24:32 -07:00
Michal Hocko
c246a213f5 mm, memory_hotplug: do not assume ZONE_NORMAL is default kernel zone
Heiko Carstens has noticed that he can generate overlapping zones for
ZONE_DMA and ZONE_NORMAL:

  DMA      [mem 0x0000000000000000-0x000000007fffffff]
  Normal   [mem 0x0000000080000000-0x000000017fffffff]

  $ cat /sys/devices/system/memory/block_size_bytes
  10000000
  $ cat /sys/devices/system/memory/memory5/valid_zones
  DMA
  $ echo 0 > /sys/devices/system/memory/memory5/online
  $ cat /sys/devices/system/memory/memory5/valid_zones
  Normal
  $ echo 1 > /sys/devices/system/memory/memory5/online
  Normal

  $ cat /proc/zoneinfo
  Node 0, zone      DMA
  spanned  524288        <-----
  present  458752
  managed  455078
  start_pfn:           0 <-----

  Node 0, zone   Normal
  spanned  720896
  present  589824
  managed  571648
  start_pfn:           327680 <-----

The reason is that we assume that the default zone for kernel onlining
is ZONE_NORMAL.  This was a simplification introduced by the memory
hotplug rework and it is easily fixable by checking the range overlap in
the zone order and considering the first matching zone as the default
one.  If there is no such zone then assume ZONE_NORMAL as we have been
doing so far.

Fixes: "mm, memory_hotplug: do not associate hotadded memory to zones until online"
Link: http://lkml.kernel.org/r/20170601083746.4924-3-mhocko@kernel.org
Signed-off-by: Michal Hocko <mhocko@suse.com>
Reported-by: Heiko Carstens <heiko.carstens@de.ibm.com>
Tested-by: Heiko Carstens <heiko.carstens@de.ibm.com>
Acked-by: Vlastimil Babka <vbabka@suse.cz>
Cc: Dan Williams <dan.j.williams@intel.com>
Cc: Reza Arbab <arbab@linux.vnet.ibm.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2017-07-06 16:24:32 -07:00
Michal Hocko
f1dd2cd13c mm, memory_hotplug: do not associate hotadded memory to zones until online
The current memory hotplug implementation relies on having all the
struct pages associate with a zone/node during the physical hotplug
phase (arch_add_memory->__add_pages->__add_section->__add_zone).  In the
vast majority of cases this means that they are added to ZONE_NORMAL.
This has been so since 9d99aaa31f ("[PATCH] x86_64: Support memory
hotadd without sparsemem") and it wasn't a big deal back then because
movable onlining didn't exist yet.

Much later memory hotplug wanted to (ab)use ZONE_MOVABLE for movable
onlining 511c2aba8f ("mm, memory-hotplug: dynamic configure movable
memory and portion memory") and then things got more complicated.
Rather than reconsidering the zone association which was no longer
needed (because the memory hotplug already depended on SPARSEMEM) a
convoluted semantic of zone shifting has been developed.  Only the
currently last memblock or the one adjacent to the zone_movable can be
onlined movable.  This essentially means that the online type changes as
the new memblocks are added.

Let's simulate memory hot online manually
  $ echo 0x100000000 > /sys/devices/system/memory/probe
  $ grep . /sys/devices/system/memory/memory32/valid_zones
  Normal Movable

  $ echo $((0x100000000+(128<<20))) > /sys/devices/system/memory/probe
  $ grep . /sys/devices/system/memory/memory3?/valid_zones
  /sys/devices/system/memory/memory32/valid_zones:Normal
  /sys/devices/system/memory/memory33/valid_zones:Normal Movable

  $ echo $((0x100000000+2*(128<<20))) > /sys/devices/system/memory/probe
  $ grep . /sys/devices/system/memory/memory3?/valid_zones
  /sys/devices/system/memory/memory32/valid_zones:Normal
  /sys/devices/system/memory/memory33/valid_zones:Normal
  /sys/devices/system/memory/memory34/valid_zones:Normal Movable

  $ echo online_movable > /sys/devices/system/memory/memory34/state
  $ grep . /sys/devices/system/memory/memory3?/valid_zones
  /sys/devices/system/memory/memory32/valid_zones:Normal
  /sys/devices/system/memory/memory33/valid_zones:Normal Movable
  /sys/devices/system/memory/memory34/valid_zones:Movable Normal

This is an awkward semantic because an udev event is sent as soon as the
block is onlined and an udev handler might want to online it based on
some policy (e.g.  association with a node) but it will inherently race
with new blocks showing up.

This patch changes the physical online phase to not associate pages with
any zone at all.  All the pages are just marked reserved and wait for
the onlining phase to be associated with the zone as per the online
request.  There are only two requirements

	- existing ZONE_NORMAL and ZONE_MOVABLE cannot overlap

	- ZONE_NORMAL precedes ZONE_MOVABLE in physical addresses

the latter one is not an inherent requirement and can be changed in the
future.  It preserves the current behavior and made the code slightly
simpler.  This is subject to change in future.

This means that the same physical online steps as above will lead to the
following state: Normal Movable

  /sys/devices/system/memory/memory32/valid_zones:Normal Movable
  /sys/devices/system/memory/memory33/valid_zones:Normal Movable

  /sys/devices/system/memory/memory32/valid_zones:Normal Movable
  /sys/devices/system/memory/memory33/valid_zones:Normal Movable
  /sys/devices/system/memory/memory34/valid_zones:Normal Movable

  /sys/devices/system/memory/memory32/valid_zones:Normal Movable
  /sys/devices/system/memory/memory33/valid_zones:Normal Movable
  /sys/devices/system/memory/memory34/valid_zones:Movable

Implementation:
The current move_pfn_range is reimplemented to check the above
requirements (allow_online_pfn_range) and then updates the respective
zone (move_pfn_range_to_zone), the pgdat and links all the pages in the
pfn range with the zone/node.  __add_pages is updated to not require the
zone and only initializes sections in the range.  This allowed to
simplify the arch_add_memory code (s390 could get rid of quite some of
code).

devm_memremap_pages is the only user of arch_add_memory which relies on
the zone association because it only hooks into the memory hotplug only
half way.  It uses it to associate the new memory with ZONE_DEVICE but
doesn't allow it to be {on,off}lined via sysfs.  This means that this
particular code path has to call move_pfn_range_to_zone explicitly.

The original zone shifting code is kept in place and will be removed in
the follow up patch for an easier review.

Please note that this patch also changes the original behavior when
offlining a memory block adjacent to another zone (Normal vs.  Movable)
used to allow to change its movable type.  This will be handled later.

[richard.weiyang@gmail.com: simplify zone_intersects()]
  Link: http://lkml.kernel.org/r/20170616092335.5177-1-richard.weiyang@gmail.com
[richard.weiyang@gmail.com: remove duplicate call for set_page_links]
  Link: http://lkml.kernel.org/r/20170616092335.5177-2-richard.weiyang@gmail.com
[akpm@linux-foundation.org: remove unused local `i']
Link: http://lkml.kernel.org/r/20170515085827.16474-12-mhocko@kernel.org
Signed-off-by: Michal Hocko <mhocko@suse.com>
Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
Tested-by: Dan Williams <dan.j.williams@intel.com>
Tested-by: Reza Arbab <arbab@linux.vnet.ibm.com>
Acked-by: Heiko Carstens <heiko.carstens@de.ibm.com> # For s390 bits
Acked-by: Vlastimil Babka <vbabka@suse.cz>
Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Andrea Arcangeli <aarcange@redhat.com>
Cc: Balbir Singh <bsingharora@gmail.com>
Cc: Daniel Kiper <daniel.kiper@oracle.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Igor Mammedov <imammedo@redhat.com>
Cc: Jerome Glisse <jglisse@redhat.com>
Cc: Joonsoo Kim <js1304@gmail.com>
Cc: Mel Gorman <mgorman@suse.de>
Cc: Tobias Regnery <tobias.regnery@gmail.com>
Cc: Toshi Kani <toshi.kani@hpe.com>
Cc: Vitaly Kuznetsov <vkuznets@redhat.com>
Cc: Xishi Qiu <qiuxishi@huawei.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2017-07-06 16:24:32 -07:00
Michal Hocko
2d070eab2e mm: consider zone which is not fully populated to have holes
__pageblock_pfn_to_page has two users currently, set_zone_contiguous
which checks whether the given zone contains holes and
pageblock_pfn_to_page which then carefully returns a first valid page
from the given pfn range for the given zone.  This doesn't handle zones
which are not fully populated though.  Memory pageblocks can be offlined
or might not have been onlined yet.  In such a case the zone should be
considered to have holes otherwise pfn walkers can touch and play with
offline pages.

Current callers of pageblock_pfn_to_page in compaction seem to work
properly right now because they only isolate PageBuddy
(isolate_freepages_block) or PageLRU resp.  __PageMovable
(isolate_migratepages_block) which will be always false for these pages.
It would be safer to skip these pages altogether, though.

In order to do this patch adds a new memory section state
(SECTION_IS_ONLINE) which is set in memory_present (during boot time) or
in online_pages_range during the memory hotplug.  Similarly
offline_mem_sections clears the bit and it is called when the memory
range is offlined.

pfn_to_online_page helper is then added which check the mem section and
only returns a page if it is onlined already.

Use the new helper in __pageblock_pfn_to_page and skip the whole page
block in such a case.

[mhocko@suse.com: check valid section number in pfn_to_online_page (Vlastimil),
 mark sections online after all struct pages are initialized in
 online_pages_range (Vlastimil)]
  Link: http://lkml.kernel.org/r/20170518164210.GD18333@dhcp22.suse.cz
Link: http://lkml.kernel.org/r/20170515085827.16474-8-mhocko@kernel.org
Signed-off-by: Michal Hocko <mhocko@suse.com>
Acked-by: Vlastimil Babka <vbabka@suse.cz>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Andrea Arcangeli <aarcange@redhat.com>
Cc: Balbir Singh <bsingharora@gmail.com>
Cc: Dan Williams <dan.j.williams@intel.com>
Cc: Daniel Kiper <daniel.kiper@oracle.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
Cc: Igor Mammedov <imammedo@redhat.com>
Cc: Jerome Glisse <jglisse@redhat.com>
Cc: Joonsoo Kim <js1304@gmail.com>
Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
Cc: Mel Gorman <mgorman@suse.de>
Cc: Reza Arbab <arbab@linux.vnet.ibm.com>
Cc: Tobias Regnery <tobias.regnery@gmail.com>
Cc: Toshi Kani <toshi.kani@hpe.com>
Cc: Vitaly Kuznetsov <vkuznets@redhat.com>
Cc: Xishi Qiu <qiuxishi@huawei.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2017-07-06 16:24:32 -07:00
Michal Hocko
1b862aecfb mm, memory_hotplug: get rid of is_zone_device_section
Device memory hotplug hooks into regular memory hotplug only half way.
It needs memory sections to track struct pages but there is no
need/desire to associate those sections with memory blocks and export
them to the userspace via sysfs because they cannot be onlined anyway.

This is currently expressed by for_device argument to arch_add_memory
which then makes sure to associate the given memory range with
ZONE_DEVICE.  register_new_memory then relies on is_zone_device_section
to distinguish special memory hotplug from the regular one.  While this
works now, later patches in this series want to move __add_zone outside
of arch_add_memory path so we have to come up with something else.

Add want_memblock down the __add_pages path and use it to control
whether the section->memblock association should be done.
arch_add_memory then just trivially want memblock for everything but
for_device hotplug.

remove_memory_section doesn't need is_zone_device_section either.  We
can simply skip all the memblock specific cleanup if there is no
memblock for the given section.

This shouldn't introduce any functional change.

Link: http://lkml.kernel.org/r/20170515085827.16474-5-mhocko@kernel.org
Signed-off-by: Michal Hocko <mhocko@suse.com>
Tested-by: Dan Williams <dan.j.williams@intel.com>
Acked-by: Vlastimil Babka <vbabka@suse.cz>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Andrea Arcangeli <aarcange@redhat.com>
Cc: Balbir Singh <bsingharora@gmail.com>
Cc: Daniel Kiper <daniel.kiper@oracle.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
Cc: Igor Mammedov <imammedo@redhat.com>
Cc: Jerome Glisse <jglisse@redhat.com>
Cc: Joonsoo Kim <js1304@gmail.com>
Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
Cc: Mel Gorman <mgorman@suse.de>
Cc: Reza Arbab <arbab@linux.vnet.ibm.com>
Cc: Tobias Regnery <tobias.regnery@gmail.com>
Cc: Toshi Kani <toshi.kani@hpe.com>
Cc: Vitaly Kuznetsov <vkuznets@redhat.com>
Cc: Xishi Qiu <qiuxishi@huawei.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2017-07-06 16:24:32 -07:00
Toshi Kani
a96dfddbcc base/memory, hotplug: fix a kernel oops in show_valid_zones()
Reading a sysfs "memoryN/valid_zones" file leads to the following oops
when the first page of a range is not backed by struct page.
show_valid_zones() assumes that 'start_pfn' is always valid for
page_zone().

 BUG: unable to handle kernel paging request at ffffea017a000000
 IP: show_valid_zones+0x6f/0x160

This issue may happen on x86-64 systems with 64GiB or more memory since
their memory block size is bumped up to 2GiB.  [1] An example of such
systems is desribed below.  0x3240000000 is only aligned by 1GiB and
this memory block starts from 0x3200000000, which is not backed by
struct page.

 BIOS-e820: [mem 0x0000003240000000-0x000000603fffffff] usable

Since test_pages_in_a_zone() already checks holes, fix this issue by
extending this function to return 'valid_start' and 'valid_end' for a
given range.  show_valid_zones() then proceeds with the valid range.

[1] 'Commit bdee237c03 ("x86: mm: Use 2GB memory block size on
    large-memory x86-64 systems")'

Link: http://lkml.kernel.org/r/20170127222149.30893-3-toshi.kani@hpe.com
Signed-off-by: Toshi Kani <toshi.kani@hpe.com>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Zhang Zhen <zhenzhang.zhang@huawei.com>
Cc: Reza Arbab <arbab@linux.vnet.ibm.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Dan Williams <dan.j.williams@intel.com>
Cc: <stable@vger.kernel.org>	[4.4+]

Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2017-02-03 14:13:19 -08:00
Yasuaki Ishimatsu
8a1f780e7f memory_hotplug: make zone_can_shift() return a boolean value
online_{kernel|movable} is used to change the memory zone to
ZONE_{NORMAL|MOVABLE} and online the memory.

To check that memory zone can be changed, zone_can_shift() is used.
Currently the function returns minus integer value, plus integer
value and 0. When the function returns minus or plus integer value,
it means that the memory zone can be changed to ZONE_{NORNAL|MOVABLE}.

But when the function returns 0, there are two meanings.

One of the meanings is that the memory zone does not need to be changed.
For example, when memory is in ZONE_NORMAL and onlined by online_kernel
the memory zone does not need to be changed.

Another meaning is that the memory zone cannot be changed. When memory
is in ZONE_NORMAL and onlined by online_movable, the memory zone may
not be changed to ZONE_MOVALBE due to memory online limitation(see
Documentation/memory-hotplug.txt). In this case, memory must not be
onlined.

The patch changes the return type of zone_can_shift() so that memory
online operation fails when memory zone cannot be changed as follows:

Before applying patch:
   # grep -A 35 "Node 2" /proc/zoneinfo
   Node 2, zone   Normal
   <snip>
      node_scanned  0
           spanned  8388608
           present  7864320
           managed  7864320
   # echo online_movable > memory4097/state
   # grep -A 35 "Node 2" /proc/zoneinfo
   Node 2, zone   Normal
   <snip>
      node_scanned  0
           spanned  8388608
           present  8388608
           managed  8388608

   online_movable operation succeeded. But memory is onlined as
   ZONE_NORMAL, not ZONE_MOVABLE.

After applying patch:
   # grep -A 35 "Node 2" /proc/zoneinfo
   Node 2, zone   Normal
   <snip>
      node_scanned  0
           spanned  8388608
           present  7864320
           managed  7864320
   # echo online_movable > memory4097/state
   bash: echo: write error: Invalid argument
   # grep -A 35 "Node 2" /proc/zoneinfo
   Node 2, zone   Normal
   <snip>
      node_scanned  0
           spanned  8388608
           present  7864320
           managed  7864320

   online_movable operation failed because of failure of changing
   the memory zone from ZONE_NORMAL to ZONE_MOVABLE

Fixes: df429ac039 ("memory-hotplug: more general validation of zone during online")
Link: http://lkml.kernel.org/r/2f9c3837-33d7-b6e5-59c0-6ca4372b2d84@gmail.com
Signed-off-by: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Reviewed-by: Reza Arbab <arbab@linux.vnet.ibm.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2017-01-24 16:26:14 -08:00
Reza Arbab
df429ac039 memory-hotplug: more general validation of zone during online
When memory is onlined, we are only able to rezone from ZONE_MOVABLE to
ZONE_KERNEL, or from (ZONE_MOVABLE - 1) to ZONE_MOVABLE.

To be more flexible, use the following criteria instead; to online
memory from zone X into zone Y,

* Any zones between X and Y must be unused.
* If X is lower than Y, the onlined memory must lie at the end of X.
* If X is higher than Y, the onlined memory must lie at the start of X.

Add zone_can_shift() to make this determination.

Link: http://lkml.kernel.org/r/1462816419-4479-3-git-send-email-arbab@linux.vnet.ibm.com
Signed-off-by: Reza Arbab <arbab@linux.vnet.ibm.com>
Reviewd-by: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Daniel Kiper <daniel.kiper@oracle.com>
Cc: Dan Williams <dan.j.williams@intel.com>
Cc: Vlastimil Babka <vbabka@suse.cz>
Cc: Tang Chen <tangchen@cn.fujitsu.com>
Cc: Joonsoo Kim <iamjoonsoo.kim@lge.com>
Cc: David Vrabel <david.vrabel@citrix.com>
Cc: Vitaly Kuznetsov <vkuznets@redhat.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Andrew Banman <abanman@sgi.com>
Cc: Chen Yucong <slaoub@gmail.com>
Cc: Yasunori Goto <y-goto@jp.fujitsu.com>
Cc: Zhang Zhen <zhenzhang.zhang@huawei.com>
Cc: Shaohua Li <shaohua.li@intel.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2016-07-26 16:19:19 -07:00
Linus Torvalds
7ded384a12 mm: fix section mismatch warning
The register_page_bootmem_info_node() function needs to be marked __init
in order to avoid a new warning introduced by commit f65e91df25 ("mm:
use early_pfn_to_nid in register_page_bootmem_info_node").

Otherwise you'll get a warning about how a non-init function calls
early_pfn_to_nid (which is __meminit)

Cc: Yang Shi <yang.shi@linaro.org>
Cc: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2016-05-27 15:23:32 -07:00
Yaowei Bai
c98940f6fa mm/memory_hotplug: is_mem_section_removable() can return bool
Make is_mem_section_removable() return bool to improve readability due
to this particular function only using either one or zero as its return
value.

Signed-off-by: Yaowei Bai <baiyaowei@cmss.chinamobile.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2016-05-19 19:12:14 -07:00
Joonsoo Kim
7cf91a98e6 mm/compaction: speed up pageblock_pfn_to_page() when zone is contiguous
There is a performance drop report due to hugepage allocation and in
there half of cpu time are spent on pageblock_pfn_to_page() in
compaction [1].

In that workload, compaction is triggered to make hugepage but most of
pageblocks are un-available for compaction due to pageblock type and
skip bit so compaction usually fails.  Most costly operations in this
case is to find valid pageblock while scanning whole zone range.  To
check if pageblock is valid to compact, valid pfn within pageblock is
required and we can obtain it by calling pageblock_pfn_to_page().  This
function checks whether pageblock is in a single zone and return valid
pfn if possible.  Problem is that we need to check it every time before
scanning pageblock even if we re-visit it and this turns out to be very
expensive in this workload.

Although we have no way to skip this pageblock check in the system where
hole exists at arbitrary position, we can use cached value for zone
continuity and just do pfn_to_page() in the system where hole doesn't
exist.  This optimization considerably speeds up in above workload.

Before vs After
  Max: 1096 MB/s vs 1325 MB/s
  Min: 635 MB/s 1015 MB/s
  Avg: 899 MB/s 1194 MB/s

Avg is improved by roughly 30% [2].

[1]: http://www.spinics.net/lists/linux-mm/msg97378.html
[2]: https://lkml.org/lkml/2015/12/9/23

[akpm@linux-foundation.org: don't forget to restore zone->contiguous on error path, per Vlastimil]
Signed-off-by: Joonsoo Kim <iamjoonsoo.kim@lge.com>
Reported-by: Aaron Lu <aaron.lu@intel.com>
Acked-by: Vlastimil Babka <vbabka@suse.cz>
Tested-by: Aaron Lu <aaron.lu@intel.com>
Cc: Mel Gorman <mgorman@suse.de>
Cc: Rik van Riel <riel@redhat.com>
Cc: David Rientjes <rientjes@google.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2016-03-15 16:55:16 -07:00
Vitaly Kuznetsov
31bc3858ea memory-hotplug: add automatic onlining policy for the newly added memory
Currently, all newly added memory blocks remain in 'offline' state
unless someone onlines them, some linux distributions carry special udev
rules like:

  SUBSYSTEM=="memory", ACTION=="add", ATTR{state}=="offline", ATTR{state}="online"

to make this happen automatically.  This is not a great solution for
virtual machines where memory hotplug is being used to address high
memory pressure situations as such onlining is slow and a userspace
process doing this (udev) has a chance of being killed by the OOM killer
as it will probably require to allocate some memory.

Introduce default policy for the newly added memory blocks in
/sys/devices/system/memory/auto_online_blocks file with two possible
values: "offline" which preserves the current behavior and "online"
which causes all newly added memory blocks to go online as soon as
they're added.  The default is "offline".

Signed-off-by: Vitaly Kuznetsov <vkuznets@redhat.com>
Reviewed-by: Daniel Kiper <daniel.kiper@oracle.com>
Cc: Jonathan Corbet <corbet@lwn.net>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Daniel Kiper <daniel.kiper@oracle.com>
Cc: Dan Williams <dan.j.williams@intel.com>
Cc: Tang Chen <tangchen@cn.fujitsu.com>
Cc: David Vrabel <david.vrabel@citrix.com>
Acked-by: David Rientjes <rientjes@google.com>
Cc: Naoya Horiguchi <n-horiguchi@ah.jp.nec.com>
Cc: Xishi Qiu <qiuxishi@huawei.com>
Cc: Mel Gorman <mgorman@techsingularity.net>
Cc: "K. Y. Srinivasan" <kys@microsoft.com>
Cc: Igor Mammedov <imammedo@redhat.com>
Cc: Kay Sievers <kay@vrfy.org>
Cc: Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>
Cc: Boris Ostrovsky <boris.ostrovsky@oracle.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2016-03-15 16:55:16 -07:00
Dan Williams
4b94ffdc41 x86, mm: introduce vmem_altmap to augment vmemmap_populate()
In support of providing struct page for large persistent memory
capacities, use struct vmem_altmap to change the default policy for
allocating memory for the memmap array.  The default vmemmap_populate()
allocates page table storage area from the page allocator.  Given
persistent memory capacities relative to DRAM it may not be feasible to
store the memmap in 'System Memory'.  Instead vmem_altmap represents
pre-allocated "device pages" to satisfy vmemmap_alloc_block_buf()
requests.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>
Reported-by: kbuild test robot <lkp@intel.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Cc: Dave Hansen <dave.hansen@linux.intel.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2016-01-15 17:56:32 -08:00
David Vrabel
62cedb9f13 mm: memory hotplug with an existing resource
Add add_memory_resource() to add memory using an existing "System RAM"
resource.  This is useful if the memory region is being located by
finding a free resource slot with allocate_resource().

Xen guests will make use of this in their balloon driver to hotplug
arbitrary amounts of memory in response to toolstack requests.

Signed-off-by: David Vrabel <david.vrabel@citrix.com>
Reviewed-by: Daniel Kiper <daniel.kiper@oracle.com>
Reviewed-by: Tang Chen <tangchen@cn.fujitsu.com>
2015-10-23 14:19:58 +01:00
Dan Williams
033fbae988 mm: ZONE_DEVICE for "device memory"
While pmem is usable as a block device or via DAX mappings to userspace
there are several usage scenarios that can not target pmem due to its
lack of struct page coverage. In preparation for "hot plugging" pmem
into the vmemmap add ZONE_DEVICE as a new zone to tag these pages
separately from the ones that are subject to standard page allocations.
Importantly "device memory" can be removed at will by userspace
unbinding the driver of the device.

Having a separate zone prevents allocation and otherwise marks these
pages that are distinct from typical uniform memory.  Device memory has
different lifetime and performance characteristics than RAM.  However,
since we have run out of ZONES_SHIFT bits this functionality currently
depends on sacrificing ZONE_DMA.

Cc: H. Peter Anvin <hpa@zytor.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Dave Hansen <dave.hansen@linux.intel.com>
Cc: Rik van Riel <riel@redhat.com>
Cc: Mel Gorman <mgorman@suse.de>
Cc: Jerome Glisse <j.glisse@gmail.com>
[hch: various simplifications in the arch interface]
Signed-off-by: Christoph Hellwig <hch@lst.de>
Signed-off-by: Dan Williams <dan.j.williams@intel.com>
2015-08-27 19:40:58 -04:00
David Rientjes
30467e0b3b mm, hotplug: fix concurrent memory hot-add deadlock
There's a deadlock when concurrently hot-adding memory through the probe
interface and switching a memory block from offline to online.

When hot-adding memory via the probe interface, add_memory() first takes
mem_hotplug_begin() and then device_lock() is later taken when registering
the newly initialized memory block.  This creates a lock dependency of (1)
mem_hotplug.lock (2) dev->mutex.

When switching a memory block from offline to online, dev->mutex is first
grabbed in device_online() when the write(2) transitions an existing
memory block from offline to online, and then online_pages() will take
mem_hotplug_begin().

This creates a lock inversion between mem_hotplug.lock and dev->mutex.
Vitaly reports that this deadlock can happen when kworker handling a probe
event races with systemd-udevd switching a memory block's state.

This patch requires the state transition to take mem_hotplug_begin()
before dev->mutex.  Hot-adding memory via the probe interface creates a
memory block while holding mem_hotplug_begin(), there is no way to take
dev->mutex first in this case.

online_pages() and offline_pages() are only called when transitioning
memory block state.  We now require that mem_hotplug_begin() is taken
before calling them -- this requires exporting the mem_hotplug_begin() and
mem_hotplug_done() to generic code.  In all hot-add and hot-remove cases,
mem_hotplug_begin() is done prior to device_online().  This is all that is
needed to avoid the deadlock.

Signed-off-by: David Rientjes <rientjes@google.com>
Reported-by: Vitaly Kuznetsov <vkuznets@redhat.com>
Tested-by: Vitaly Kuznetsov <vkuznets@redhat.com>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: "Rafael J. Wysocki" <rjw@rjwysocki.net>
Cc: "K. Y. Srinivasan" <kys@microsoft.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: Tang Chen <tangchen@cn.fujitsu.com>
Cc: Vlastimil Babka <vbabka@suse.cz>
Cc: Zhang Zhen <zhenzhang.zhang@huawei.com>
Cc: Vladimir Davydov <vdavydov@parallels.com>
Cc: Wang Nan <wangnan0@huawei.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2015-04-14 16:49:00 -07:00
Zhang Zhen
ed2f240094 memory-hotplug: add sysfs valid_zones attribute
Currently memory-hotplug has two limits:

1. If the memory block is in ZONE_NORMAL, you can change it to
   ZONE_MOVABLE, but this memory block must be adjacent to ZONE_MOVABLE.

2. If the memory block is in ZONE_MOVABLE, you can change it to
   ZONE_NORMAL, but this memory block must be adjacent to ZONE_NORMAL.

With this patch, we can easy to know a memory block can be onlined to
which zone, and don't need to know the above two limits.

Updated the related Documentation.

[akpm@linux-foundation.org: use conventional comment layout]
[akpm@linux-foundation.org: fix build with CONFIG_MEMORY_HOTREMOVE=n]
[akpm@linux-foundation.org: remove unused local zone_prev]
Signed-off-by: Zhang Zhen <zhenzhang.zhang@huawei.com>
Cc: Dave Hansen <dave.hansen@intel.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Toshi Kani <toshi.kani@hp.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: Naoya Horiguchi <n-horiguchi@ah.jp.nec.com>
Cc: Wang Nan <wangnan0@huawei.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2014-10-09 22:25:52 -04:00
Wang Nan
6326440077 memory-hotplug: add zone_for_memory() for selecting zone for new memory
This series of patches fixes a problem when adding memory in bad manner.
For example: for a x86_64 machine booted with "mem=400M" and with 2GiB
memory installed, following commands cause problem:

  # echo 0x40000000 > /sys/devices/system/memory/probe
 [   28.613895] init_memory_mapping: [mem 0x40000000-0x47ffffff]
  # echo 0x48000000 > /sys/devices/system/memory/probe
 [   28.693675] init_memory_mapping: [mem 0x48000000-0x4fffffff]
  # echo online_movable > /sys/devices/system/memory/memory9/state
  # echo 0x50000000 > /sys/devices/system/memory/probe
 [   29.084090] init_memory_mapping: [mem 0x50000000-0x57ffffff]
  # echo 0x58000000 > /sys/devices/system/memory/probe
 [   29.151880] init_memory_mapping: [mem 0x58000000-0x5fffffff]
  # echo online_movable > /sys/devices/system/memory/memory11/state
  # echo online> /sys/devices/system/memory/memory8/state
  # echo online> /sys/devices/system/memory/memory10/state
  # echo offline> /sys/devices/system/memory/memory9/state
 [   30.558819] Offlined Pages 32768
  # free
              total       used       free     shared    buffers     cached
 Mem:        780588 18014398509432020     830552          0          0      51180
 -/+ buffers/cache: 18014398509380840     881732
 Swap:            0          0          0

This is because the above commands probe higher memory after online a
section with online_movable, which causes ZONE_HIGHMEM (or ZONE_NORMAL
for systems without ZONE_HIGHMEM) overlaps ZONE_MOVABLE.

After the second online_movable, the problem can be observed from
zoneinfo:

  # cat /proc/zoneinfo
  ...
  Node 0, zone  Movable
    pages free     65491
          min      250
          low      312
          high     375
          scanned  0
          spanned  18446744073709518848
          present  65536
          managed  65536
  ...

This series of patches solve the problem by checking ZONE_MOVABLE when
choosing zone for new memory.  If new memory is inside or higher than
ZONE_MOVABLE, makes it go there instead.

After applying this series of patches, following are free and zoneinfo
result (after offlining memory9):

  bash-4.2# free
                total       used       free     shared    buffers     cached
   Mem:        780956      80112     700844          0          0      51180
   -/+ buffers/cache:      28932     752024
   Swap:            0          0          0

  bash-4.2# cat /proc/zoneinfo

  Node 0, zone      DMA
    pages free     3389
          min      14
          low      17
          high     21
          scanned  0
          spanned  4095
          present  3998
          managed  3977
      nr_free_pages 3389
  ...
    start_pfn:         1
    inactive_ratio:    1
  Node 0, zone    DMA32
    pages free     73724
          min      341
          low      426
          high     511
          scanned  0
          spanned  98304
          present  98304
          managed  92958
      nr_free_pages 73724
    ...
    start_pfn:         4096
    inactive_ratio:    1
  Node 0, zone   Normal
    pages free     32630
          min      120
          low      150
          high     180
          scanned  0
          spanned  32768
          present  32768
          managed  32768
      nr_free_pages 32630
  ...
    start_pfn:         262144
    inactive_ratio:    1
  Node 0, zone  Movable
    pages free     65476
          min      241
          low      301
          high     361
          scanned  0
          spanned  98304
          present  65536
          managed  65536
      nr_free_pages 65476
  ...
    start_pfn:         294912
    inactive_ratio:    1

This patch (of 7):

Introduce zone_for_memory() in arch independent code for
arch_add_memory() use.

Many arch_add_memory() function simply selects ZONE_HIGHMEM or
ZONE_NORMAL and add new memory into it.  However, with the existance of
ZONE_MOVABLE, the selection method should be carefully considered: if
new, higher memory is added after ZONE_MOVABLE is setup, the default
zone and ZONE_MOVABLE may overlap each other.

should_add_memory_movable() checks the status of ZONE_MOVABLE.  If it
has already contain memory, compare the address of new memory and
movable memory.  If new memory is higher than movable, it should be
added into ZONE_MOVABLE instead of default zone.

Signed-off-by: Wang Nan <wangnan0@huawei.com>
Cc: Zhang Yanfei <zhangyanfei@cn.fujitsu.com>
Cc: Dave Hansen <dave.hansen@intel.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Yinghai Lu <yinghai@kernel.org>
Cc: "Mel Gorman" <mgorman@suse.de>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Cc: "Luck, Tony" <tony.luck@intel.com>
Cc: Benjamin Herrenschmidt <benh@kernel.crashing.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Chris Metcalf <cmetcalf@tilera.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2014-08-06 18:01:21 -07:00
Tang Chen
4f7c6b49c4 mem-hotplug: introduce MMOP_OFFLINE to replace the hard coding -1
In store_mem_state(), we have:

  ...
  334         else if (!strncmp(buf, "offline", min_t(int, count, 7)))
  335                 online_type = -1;
  ...
  355         case -1:
  356                 ret = device_offline(&mem->dev);
  357                 break;
  ...

Here, "offline" is hard coded as -1.

This patch does the following renaming:

 ONLINE_KEEP     ->  MMOP_ONLINE_KEEP
 ONLINE_KERNEL   ->  MMOP_ONLINE_KERNEL
 ONLINE_MOVABLE  ->  MMOP_ONLINE_MOVABLE

and introduces MMOP_OFFLINE = -1 to avoid hard coding.

Signed-off-by: Tang Chen <tangchen@cn.fujitsu.com>
Cc: Hu Tao <hutao@cn.fujitsu.com>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Lai Jiangshan <laijs@cn.fujitsu.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: Gu Zheng <guz.fnst@cn.fujitsu.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2014-08-06 18:01:16 -07:00
Vladimir Davydov
bfc8c90139 mem-hotplug: implement get/put_online_mems
kmem_cache_{create,destroy,shrink} need to get a stable value of
cpu/node online mask, because they init/destroy/access per-cpu/node
kmem_cache parts, which can be allocated or destroyed on cpu/mem
hotplug.  To protect against cpu hotplug, these functions use
{get,put}_online_cpus.  However, they do nothing to synchronize with
memory hotplug - taking the slab_mutex does not eliminate the
possibility of race as described in patch 2.

What we need there is something like get_online_cpus, but for memory.
We already have lock_memory_hotplug, which serves for the purpose, but
it's a bit of a hammer right now, because it's backed by a mutex.  As a
result, it imposes some limitations to locking order, which are not
desirable, and can't be used just like get_online_cpus.  That's why in
patch 1 I substitute it with get/put_online_mems, which work exactly
like get/put_online_cpus except they block not cpu, but memory hotplug.

[ v1 can be found at https://lkml.org/lkml/2014/4/6/68.  I NAK'ed it by
  myself, because it used an rw semaphore for get/put_online_mems,
  making them dead lock prune.  ]

This patch (of 2):

{un}lock_memory_hotplug, which is used to synchronize against memory
hotplug, is currently backed by a mutex, which makes it a bit of a
hammer - threads that only want to get a stable value of online nodes
mask won't be able to proceed concurrently.  Also, it imposes some
strong locking ordering rules on it, which narrows down the set of its
usage scenarios.

This patch introduces get/put_online_mems, which are the same as
get/put_online_cpus, but for memory hotplug, i.e.  executing a code
inside a get/put_online_mems section will guarantee a stable value of
online nodes, present pages, etc.

lock_memory_hotplug()/unlock_memory_hotplug() are removed altogether.

Signed-off-by: Vladimir Davydov <vdavydov@parallels.com>
Cc: Christoph Lameter <cl@linux.com>
Cc: Pekka Enberg <penberg@kernel.org>
Cc: Tang Chen <tangchen@cn.fujitsu.com>
Cc: Zhang Yanfei <zhangyanfei@cn.fujitsu.com>
Cc: Toshi Kani <toshi.kani@hp.com>
Cc: Xishi Qiu <qiuxishi@huawei.com>
Cc: Jiang Liu <liuj97@gmail.com>
Cc: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Wen Congyang <wency@cn.fujitsu.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: Lai Jiangshan <laijs@cn.fujitsu.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2014-06-04 16:53:59 -07:00
Zhang Yanfei
85b35feaec mm/sparsemem: use PAGES_PER_SECTION to remove redundant nr_pages parameter
For below functions,

- sparse_add_one_section()
- kmalloc_section_memmap()
- __kmalloc_section_memmap()
- __kfree_section_memmap()

they are always invoked to operate on one memory section, so it is
redundant to always pass a nr_pages parameter, which is the page numbers
in one section.  So we can directly use predefined macro PAGES_PER_SECTION
instead of passing the parameter.

Signed-off-by: Zhang Yanfei <zhangyanfei@cn.fujitsu.com>
Cc: Wen Congyang <wency@cn.fujitsu.com>
Cc: Tang Chen <tangchen@cn.fujitsu.com>
Cc: Toshi Kani <toshi.kani@hp.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: Yinghai Lu <yinghai@kernel.org>
Cc: Yasunori Goto <y-goto@jp.fujitsu.com>
Cc: Andy Whitcroft <apw@shadowen.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-11-13 12:09:06 +09:00
Toshi Kani
01b0f19707 cpu/mem hotplug: add try_online_node() for cpu_up()
cpu_up() has #ifdef CONFIG_MEMORY_HOTPLUG code blocks, which call
mem_online_node() to put its node online if offlined and then call
build_all_zonelists() to initialize the zone list.

These steps are specific to memory hotplug, and should be managed in
mm/memory_hotplug.c.  lock_memory_hotplug() should also be held for the
whole steps.

For this reason, this patch replaces mem_online_node() with
try_online_node(), which performs the whole steps with
lock_memory_hotplug() held.  try_online_node() is named after
try_offline_node() as they have similar purpose.

There is no functional change in this patch.

Signed-off-by: Toshi Kani <toshi.kani@hp.com>
Reviewed-by: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-11-13 12:09:04 +09:00
Rafael J. Wysocki
aba6efc471 Memory hotplug: Move alternative function definitions to header
Move the definitions of offline_pages() and remove_memory()
for CONFIG_MEMORY_HOTREMOVE to memory_hotplug.h, where they belong,
and make them static inline.

Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
2013-06-01 22:24:07 +02:00
Rafael J. Wysocki
242831eb15 Memory hotplug / ACPI: Simplify memory removal
Now that the memory offlining should be taken care of by the
companion device offlining code in acpi_scan_hot_remove(), the
ACPI memory hotplug driver doesn't need to offline it in
remove_memory() any more.  Moreover, since the return value of
remove_memory() is not used, it's better to make it be a void
function and trigger a BUG() if the memory scheduled for removal is
not offline.

Change the code in accordance with the above observations.

Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Reviewed-by: Toshi Kani <toshi.kani@hp.com>
2013-06-01 21:37:10 +02:00
Rafael J. Wysocki
ea50be5934 Driver core / MM: Drop offline_memory_block()
Since offline_memory_block(mem) is functionally equivalent to
device_offline(&mem->dev), make the only caller of the former use
the latter instead and drop offline_memory_block() entirely.

Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Acked-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Acked-by: Toshi Kani <toshi.kani@hp.com>
2013-06-01 21:37:10 +02:00
Rafael J. Wysocki
e2ff39400d ACPI / memhotplug: Bind removable memory blocks to ACPI device nodes
During ACPI memory hotplug configuration bind memory blocks residing
in modules removable through the standard ACPI mechanism to struct
acpi_device objects associated with ACPI namespace objects
representing those modules.  Accordingly, unbind those memory blocks
from the struct acpi_device objects when the memory modules in
question are being removed.

When "offline" operation for devices representing memory blocks is
introduced, this will allow the ACPI core's device hot-remove code to
use it to carry out remove_memory() for those memory blocks and check
the results of that before it actually removes the modules holding
them from the system.

Since walk_memory_range() is used for accessing all memory blocks
corresponding to a given ACPI namespace object, it is exported from
memory_hotplug.c so that the code in acpi_memhotplug.c can use it.

Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Tested-by: Vasilis Liaskovitis <vasilis.liaskovitis@profitbricks.com>
Reviewed-by: Toshi Kani <toshi.kani@hp.com>
2013-05-12 14:14:38 +02:00
David Rientjes
4edd7ceff0 mm, hotplug: avoid compiling memory hotremove functions when disabled
__remove_pages() is only necessary for CONFIG_MEMORY_HOTREMOVE.  PowerPC
pseries will return -EOPNOTSUPP if unsupported.

Adding an #ifdef causes several other functions it depends on to also
become unnecessary, which saves in .text when disabled (it's disabled in
most defconfigs besides powerpc, including x86).  remove_memory_block()
becomes static since it is not referenced outside of
drivers/base/memory.c.

Build tested on x86 and powerpc with CONFIG_MEMORY_HOTREMOVE both enabled
and disabled.

Signed-off-by: David Rientjes <rientjes@google.com>
Acked-by: Toshi Kani <toshi.kani@hp.com>
Cc: Benjamin Herrenschmidt <benh@kernel.crashing.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Wen Congyang <wency@cn.fujitsu.com>
Cc: Tang Chen <tangchen@cn.fujitsu.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-04-29 15:54:37 -07:00
Wen Congyang
90b30cdc1d memory-hotplug: export the function try_offline_node()
try_offline_node() will be needed in the tristate
drivers/acpi/processor_driver.c.

The node will be offlined when all memory/cpu on the node have been
hotremoved.  So we need the function try_offline_node() in cpu-hotplug
path.

If the memory-hotplug is disabled, and cpu-hotplug is enabled

1. no memory no the node
   we don't online the node, and cpu's node is the nearest node.

2. the node contains some memory
   the node has been onlined, and cpu's node is still needed
   to migrate the sleep task on the cpu to the same node.

So we do nothing in try_offline_node() in this case.

[rientjes@google.com: export the function try_offline_node() fix]
Signed-off-by: Wen Congyang <wency@cn.fujitsu.com>
Signed-off-by: Tang Chen <tangchen@cn.fujitsu.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Jiang Liu <liuj97@gmail.com>
Cc: Minchan Kim <minchan.kim@gmail.com>
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
Cc: Mel Gorman <mel@csn.ul.ie>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Len Brown <lenb@kernel.org>
Signed-off-by: David Rientjes <rientjes@google.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-02-23 17:50:13 -08:00
Tang Chen
60a5a19e74 memory-hotplug: remove sysfs file of node
Introduce a new function try_offline_node() to remove sysfs file of node
when all memory sections of this node are removed.  If some memory
sections of this node are not removed, this function does nothing.

Signed-off-by: Wen Congyang <wency@cn.fujitsu.com>
Signed-off-by: Tang Chen <tangchen@cn.fujitsu.com>
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
Cc: Jiang Liu <jiang.liu@huawei.com>
Cc: Jianguo Wu <wujianguo@huawei.com>
Cc: Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: Lai Jiangshan <laijs@cn.fujitsu.com>
Cc: Wu Jianguo <wujianguo@huawei.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: Ingo Molnar <mingo@elte.hu>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-02-23 17:50:13 -08:00
Yasuaki Ishimatsu
46723bfa54 memory-hotplug: implement register_page_bootmem_info_section of sparse-vmemmap
For removing memmap region of sparse-vmemmap which is allocated bootmem,
memmap region of sparse-vmemmap needs to be registered by
get_page_bootmem().  So the patch searches pages of virtual mapping and
registers the pages by get_page_bootmem().

NOTE: register_page_bootmem_memmap() is not implemented for ia64,
      ppc, s390, and sparc.  So introduce CONFIG_HAVE_BOOTMEM_INFO_NODE
      and revert register_page_bootmem_info_node() when platform doesn't
      support it.

      It's implemented by adding a new Kconfig option named
      CONFIG_HAVE_BOOTMEM_INFO_NODE, which will be automatically selected
      by memory-hotplug feature fully supported archs(currently only on
      x86_64).

      Since we have 2 config options called MEMORY_HOTPLUG and
      MEMORY_HOTREMOVE used for memory hot-add and hot-remove separately,
      and codes in function register_page_bootmem_info_node() are only
      used for collecting infomation for hot-remove, so reside it under
      MEMORY_HOTREMOVE.

      Besides page_isolation.c selected by MEMORY_ISOLATION under
      MEMORY_HOTPLUG is also such case, move it too.

[mhocko@suse.cz: put register_page_bootmem_memmap inside CONFIG_MEMORY_HOTPLUG_SPARSE]
[linfeng@cn.fujitsu.com: introduce CONFIG_HAVE_BOOTMEM_INFO_NODE and revert register_page_bootmem_info_node()]
[mhocko@suse.cz: remove the arch specific functions without any implementation]
[linfeng@cn.fujitsu.com: mm/Kconfig: move auto selects from MEMORY_HOTPLUG to MEMORY_HOTREMOVE as needed]
[rientjes@google.com: fix defined but not used warning]
Signed-off-by: Wen Congyang <wency@cn.fujitsu.com>
Signed-off-by: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Signed-off-by: Tang Chen <tangchen@cn.fujitsu.com>
Reviewed-by: Wu Jianguo <wujianguo@huawei.com>
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
Cc: Jiang Liu <jiang.liu@huawei.com>
Cc: Jianguo Wu <wujianguo@huawei.com>
Cc: Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: Lai Jiangshan <laijs@cn.fujitsu.com>
Cc: Ingo Molnar <mingo@elte.hu>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Signed-off-by: Michal Hocko <mhocko@suse.cz>
Signed-off-by: Lin Feng <linfeng@cn.fujitsu.com>
Signed-off-by: David Rientjes <rientjes@google.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-02-23 17:50:12 -08:00
Wen Congyang
24d335ca36 memory-hotplug: introduce new arch_remove_memory() for removing page table
For removing memory, we need to remove page tables.  But it depends on
architecture.  So the patch introduce arch_remove_memory() for removing
page table.  Now it only calls __remove_pages().

Note: __remove_pages() for some archtecuture is not implemented
      (I don't know how to implement it for s390).

Signed-off-by: Wen Congyang <wency@cn.fujitsu.com>
Signed-off-by: Tang Chen <tangchen@cn.fujitsu.com>
Acked-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
Cc: Jiang Liu <jiang.liu@huawei.com>
Cc: Jianguo Wu <wujianguo@huawei.com>
Cc: Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: Lai Jiangshan <laijs@cn.fujitsu.com>
Cc: Wu Jianguo <wujianguo@huawei.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: Ingo Molnar <mingo@elte.hu>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-02-23 17:50:12 -08:00
Yasuaki Ishimatsu
6677e3eaf4 memory-hotplug: check whether all memory blocks are offlined or not when removing memory
We remove the memory like this:

 1. lock memory hotplug
 2. offline a memory block
 3. unlock memory hotplug
 4. repeat 1-3 to offline all memory blocks
 5. lock memory hotplug
 6. remove memory(TODO)
 7. unlock memory hotplug

All memory blocks must be offlined before removing memory.  But we don't
hold the lock in the whole operation.  So we should check whether all
memory blocks are offlined before step6.  Otherwise, kernel maybe
panicked.

Offlining a memory block and removing a memory device can be two
different operations.  Users can just offline some memory blocks without
removing the memory device.  For this purpose, the kernel has held
lock_memory_hotplug() in __offline_pages().  To reuse the code for
memory hot-remove, we repeat step 1-3 to offline all the memory blocks,
repeatedly lock and unlock memory hotplug, but not hold the memory
hotplug lock in the whole operation.

Signed-off-by: Wen Congyang <wency@cn.fujitsu.com>
Signed-off-by: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Signed-off-by: Tang Chen <tangchen@cn.fujitsu.com>
Acked-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
Cc: Jiang Liu <jiang.liu@huawei.com>
Cc: Jianguo Wu <wujianguo@huawei.com>
Cc: Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: Lai Jiangshan <laijs@cn.fujitsu.com>
Cc: Wu Jianguo <wujianguo@huawei.com>
Cc: Ingo Molnar <mingo@elte.hu>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-02-23 17:50:11 -08:00
Lai Jiangshan
511c2aba8f mm, memory-hotplug: dynamic configure movable memory and portion memory
Add online_movable and online_kernel for logic memory hotplug.  This is
the dynamic version of "movablecore" & "kernelcore".

We have the same reason to introduce it as to introduce "movablecore" &
"kernelcore".  It has the same motive as "movablecore" & "kernelcore", but
it is dynamic/running-time:

o We can configure memory as kernelcore or movablecore after boot.

  Userspace workload is increased, we need more hugepage, we can't use
  "online_movable" to add memory and allow the system use more
  THP(transparent-huge-page), vice-verse when kernel workload is increase.

  Also help for virtualization to dynamic configure host/guest's memory,
  to save/(reduce waste) memory.

  Memory capacity on Demand

o When a new node is physically online after boot, we need to use
  "online_movable" or "online_kernel" to configure/portion it as we
  expected when we logic-online it.

  This configuration also helps for physically-memory-migrate.

o all benefit as the same as existed "movablecore" & "kernelcore".

o Preparing for movable-node, which is very important for power-saving,
  hardware partitioning and high-available-system(hardware fault
  management).

(Note, we don't introduce movable-node here.)

Action behavior:
When a memoryblock/memorysection is onlined by "online_movable", the kernel
will not have directly reference to the page of the memoryblock,
thus we can remove that memory any time when needed.

When it is online by "online_kernel", the kernel can use it.
When it is online by "online", the zone type doesn't changed.

Current constraints:
Only the memoryblock which is adjacent to the ZONE_MOVABLE
can be online from ZONE_NORMAL to ZONE_MOVABLE.

[akpm@linux-foundation.org: use min_t, cleanups]
Signed-off-by: Lai Jiangshan <laijs@cn.fujitsu.com>
Signed-off-by: Wen Congyang <wency@cn.fujitsu.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: Lai Jiangshan <laijs@cn.fujitsu.com>
Cc: Jiang Liu <jiang.liu@huawei.com>
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
Cc: Minchan Kim <minchan.kim@gmail.com>
Cc: Mel Gorman <mgorman@suse.de>
Cc: David Rientjes <rientjes@google.com>
Cc: Yinghai Lu <yinghai@kernel.org>
Cc: Rusty Russell <rusty@rustcorp.com.au>
Cc: Greg KH <greg@kroah.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2012-12-11 17:22:28 -08:00
Wen Congyang
e90bdb7f52 memory-hotplug: update memory block's state and notify userspace
remove_memory() will be called when hot removing a memory device.  But
even if offlining memory, we cannot notice it.  So the patch updates the
memory block's state and sends notification to userspace.

Additionally, the memory device may contain more than one memory block.
If the memory block has been offlined, __offline_pages() will fail.  So we
should try to offline one memory block at a time.

Thus remove_memory() also check each memory block's state.  So there is no
need to check the memory block's state before calling remove_memory().

Signed-off-by: Wen Congyang <wency@cn.fujitsu.com>
Signed-off-by: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Jiang Liu <liuj97@gmail.com>
Cc: Len Brown <len.brown@intel.com>
Cc: Christoph Lameter <cl@linux.com>
Cc: Minchan Kim <minchan.kim@gmail.com>
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2012-10-09 16:23:02 +09:00
Wen Congyang
a16cee10c7 memory-hotplug: preparation to notify memory block's state at memory hot remove
remove_memory() is called in two cases:
1. echo offline >/sys/devices/system/memory/memoryXX/state
2. hot remove a memory device

In the 1st case, the memory block's state is changed and the notification
that memory block's state changed is sent to userland after calling
remove_memory().  So user can notice memory block is changed.

But in the 2nd case, the memory block's state is not changed and the
notification is not also sent to userspcae even if calling
remove_memory().  So user cannot notice memory block is changed.

For adding the notification at memory hot remove, the patch just prepare
as follows:
1st case uses offline_pages() for offlining memory.
2nd case uses remove_memory() for offlining memory and changing memory block's
    state and notifing the information.

The patch does not implement notification to remove_memory().

Signed-off-by: Wen Congyang <wency@cn.fujitsu.com>
Signed-off-by: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Jiang Liu <liuj97@gmail.com>
Cc: Len Brown <len.brown@intel.com>
Cc: Christoph Lameter <cl@linux.com>
Cc: Minchan Kim <minchan.kim@gmail.com>
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2012-10-09 16:23:02 +09:00
Paul Gortmaker
187f1882b5 BUG: headers with BUG/BUG_ON etc. need linux/bug.h
If a header file is making use of BUG, BUG_ON, BUILD_BUG_ON, or any
other BUG variant in a static inline (i.e. not in a #define) then
that header really should be including <linux/bug.h> and not just
expecting it to be implicitly present.

We can make this change risk-free, since if the files using these
headers didn't have exposure to linux/bug.h already, they would have
been causing compile failures/warnings.

Signed-off-by: Paul Gortmaker <paul.gortmaker@windriver.com>
2012-03-04 17:54:34 -05:00
Daniel Kiper
9d0ad8ca43 mm: extend memory hotplug API to allow memory hotplug in virtual machines
This patch contains online_page_callback and apropriate functions for
registering/unregistering online page callbacks.  It allows to do some
machine specific tasks during online page stage which is required to
implement memory hotplug in virtual machines.  Currently this patch is
required by latest memory hotplug support for Xen balloon driver patch
which will be posted soon.

Additionally, originial online_page() function was splited into
following functions doing "atomic" operations:

  - __online_page_set_limits() - set new limits for memory management code,
  - __online_page_increment_counters() - increment totalram_pages and totalhigh_pages,
  - __online_page_free() - free page to allocator.

It was done to:
  - not duplicate existing code,
  - ease hotplug code devolpment by usage of well defined interface,
  - avoid stupid bugs which are unavoidable when the same code
    (by design) is developed in many places.

[akpm@linux-foundation.org: use explicit indirect-call syntax]
Signed-off-by: Daniel Kiper <dkiper@net-space.pl>
Reviewed-by: Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>
Cc: Ian Campbell <ian.campbell@citrix.com>
Cc: Jeremy Fitzhardinge <jeremy@goop.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2011-07-25 20:57:08 -07:00
Pekka Enberg
597fb188cb Merge branch 'slub/hotplug' into slab/urgent 2011-01-15 13:28:17 +02:00