Message ID | 20240704073544.670249-1-sughosh.ganu@linaro.org |
---|---|
Headers | show |
Series | Make U-Boot memory reservations coherent | expand |
On Thu, Jul 04, 2024 at 01:04:56PM +0530, Sughosh Ganu wrote: > The aim of this patch series is to fix the current state of > incoherence between modules when it comes to memory usage. The primary > issue that this series is trying to fix is that the EFI memory module > which is responsible for allocating and freeing memory, does not have > any visibility of the memory that is being used by the LMB > module. This is further complicated by the fact that the LMB > allocations are caller specific -- the LMB memory map is not global > nor persistent. This means that the memory "allocated" by the LMB > module might be relevant only for a given function. Hence one of the > requirements for making the memory usage visible across modules is to > make LMB allocations persistent and global, and then have means to > communicate the use of memory across modules. > > The first set of patches in this series work on making the LMB memory > map persistent and global. This is being done keeping in mind the > usage of LMB memory by platforms where the same memory region can be > used to load multiple different images. What is not allowed is to > overwrite memory that has been allocated by the other module, > currently the EFI memory module. This is being achieved by introducing > a new flag, LMB_NOOVERWRITE, which represents memory which cannot be > re-requested once allocated. > > A review comment on the earlier version was to do away with the static > arrays for the LMB lists of free and used memory. This version > uses the alloced list data structure for the LMB lists. > > The second set of patches are making changes to the EFI memory module > to make use of the LMB functions to allocate and free memory. A > *_flags() version of LMB API's has been introduced for the same. The > earlier version was using notification mechanism from both LMB and EFI > modules to maintain memory coherence. This version makes use of the > LMB API functions for the memory allocations. This is based on review > comments of EFI maintainers. On am64x_evm_a53, the last test in test/py/tests/test_net_boot.py fails due to: ... TFTP from server 192.168.116.10; our IP address is 192.168.116.23 Filename 'pxelinux.cfg/default-arm-k3'. Load address: 0x80100000 Loading: ################################################## 64 Bytes 8.8 KiB/s done Bytes transferred = 64 (40 hex) 1 pxe ready ethernet 0 port@1.bootdev.0 extlinux/extlinux.conf ** Booting bootflow 'port@1.bootdev.0' with pxe Retrieving file: pxelinux.cfg/default-arm am65_cpsw_nuss_port ethernet@8000000port@1: K3 CPSW: rflow_id_base: 16 link up on port 1, speed 1000, full duplex Using ethernet@8000000port@1 device TFTP from server 192.168.116.10; our IP address is 192.168.116.23 Filename 'pxelinux.cfg/default-arm'. TFTP error: trying to overwrite reserved memory... Couldn't retrieve pxelinux.cfg/default-arm And note that the pxelinux.cfg files are created as defined by the example within the test. This test is also still fine on Pi 4.
On Thu, Jul 04, 2024 at 01:04:56PM +0530, Sughosh Ganu wrote: > The aim of this patch series is to fix the current state of > incoherence between modules when it comes to memory usage. The primary > issue that this series is trying to fix is that the EFI memory module > which is responsible for allocating and freeing memory, does not have > any visibility of the memory that is being used by the LMB > module. This is further complicated by the fact that the LMB > allocations are caller specific -- the LMB memory map is not global > nor persistent. This means that the memory "allocated" by the LMB > module might be relevant only for a given function. Hence one of the > requirements for making the memory usage visible across modules is to > make LMB allocations persistent and global, and then have means to > communicate the use of memory across modules. > > The first set of patches in this series work on making the LMB memory > map persistent and global. This is being done keeping in mind the > usage of LMB memory by platforms where the same memory region can be > used to load multiple different images. What is not allowed is to > overwrite memory that has been allocated by the other module, > currently the EFI memory module. This is being achieved by introducing > a new flag, LMB_NOOVERWRITE, which represents memory which cannot be > re-requested once allocated. > > A review comment on the earlier version was to do away with the static > arrays for the LMB lists of free and used memory. This version > uses the alloced list data structure for the LMB lists. > > The second set of patches are making changes to the EFI memory module > to make use of the LMB functions to allocate and free memory. A > *_flags() version of LMB API's has been introduced for the same. The > earlier version was using notification mechanism from both LMB and EFI > modules to maintain memory coherence. This version makes use of the > LMB API functions for the memory allocations. This is based on review > comments of EFI maintainers. Please re-work so that the series is bisectable. For example xilinx_zynqmp_r5 fails that check currently. And I found that looking in to why it grows by ~1500 bytes overall. This likely is due to CONFIG_EFI_LOADER=n and so while the case where EFI_LOADER is enabled tends to be a size win (reduction) or wash we need to look at the CONFIG_EFI_LOADER=n case more. The alist code will be a little growth and that's fine enough. But realloc and do_bdinfo are the two big growths at the top, in this case.
Hi Sugosh, On Mon, 8 Jul 2024 at 15:35, Tom Rini <trini@konsulko.com> wrote: > > On Thu, Jul 04, 2024 at 01:04:56PM +0530, Sughosh Ganu wrote: I don't believe coherent is the right word here. Perhaps 'more persistent' is better? > > > The aim of this patch series is to fix the current state of > > incoherence between modules when it comes to memory usage. The primary > > issue that this series is trying to fix is that the EFI memory module > > which is responsible for allocating and freeing memory, does not have > > any visibility of the memory that is being used by the LMB > > module. This is further complicated by the fact that the LMB > > allocations are caller specific -- the LMB memory map is not global > > nor persistent. This means that the memory "allocated" by the LMB > > module might be relevant only for a given function. Hence one of the > > requirements for making the memory usage visible across modules is to > > make LMB allocations persistent and global, and then have means to > > communicate the use of memory across modules. > > > > The first set of patches in this series work on making the LMB memory > > map persistent and global. This is being done keeping in mind the > > usage of LMB memory by platforms where the same memory region can be > > used to load multiple different images. What is not allowed is to > > overwrite memory that has been allocated by the other module, > > currently the EFI memory module. This is being achieved by introducing > > a new flag, LMB_NOOVERWRITE, which represents memory which cannot be > > re-requested once allocated. > > > > A review comment on the earlier version was to do away with the static > > arrays for the LMB lists of free and used memory. This version > > uses the alloced list data structure for the LMB lists. > > > > The second set of patches are making changes to the EFI memory module > > to make use of the LMB functions to allocate and free memory. A > > *_flags() version of LMB API's has been introduced for the same. The > > earlier version was using notification mechanism from both LMB and EFI > > modules to maintain memory coherence. This version makes use of the > > LMB API functions for the memory allocations. This is based on review > > comments of EFI maintainers. > > Please re-work so that the series is bisectable. For example > xilinx_zynqmp_r5 fails that check currently. And I found that looking in > to why it grows by ~1500 bytes overall. This likely is due to > CONFIG_EFI_LOADER=n and so while the case where EFI_LOADER is enabled > tends to be a size win (reduction) or wash we need to look at the > CONFIG_EFI_LOADER=n case more. The alist code will be a little growth > and that's fine enough. But realloc and do_bdinfo are the two big > growths at the top, in this case. > > -- > Tom Regards, Simon
On Mon, 8 Jul 2024 at 19:32, Tom Rini <trini@konsulko.com> wrote: > > On Thu, Jul 04, 2024 at 01:04:56PM +0530, Sughosh Ganu wrote: > > > The aim of this patch series is to fix the current state of > > incoherence between modules when it comes to memory usage. The primary > > issue that this series is trying to fix is that the EFI memory module > > which is responsible for allocating and freeing memory, does not have > > any visibility of the memory that is being used by the LMB > > module. This is further complicated by the fact that the LMB > > allocations are caller specific -- the LMB memory map is not global > > nor persistent. This means that the memory "allocated" by the LMB > > module might be relevant only for a given function. Hence one of the > > requirements for making the memory usage visible across modules is to > > make LMB allocations persistent and global, and then have means to > > communicate the use of memory across modules. > > > > The first set of patches in this series work on making the LMB memory > > map persistent and global. This is being done keeping in mind the > > usage of LMB memory by platforms where the same memory region can be > > used to load multiple different images. What is not allowed is to > > overwrite memory that has been allocated by the other module, > > currently the EFI memory module. This is being achieved by introducing > > a new flag, LMB_NOOVERWRITE, which represents memory which cannot be > > re-requested once allocated. > > > > A review comment on the earlier version was to do away with the static > > arrays for the LMB lists of free and used memory. This version > > uses the alloced list data structure for the LMB lists. > > > > The second set of patches are making changes to the EFI memory module > > to make use of the LMB functions to allocate and free memory. A > > *_flags() version of LMB API's has been introduced for the same. The > > earlier version was using notification mechanism from both LMB and EFI > > modules to maintain memory coherence. This version makes use of the > > LMB API functions for the memory allocations. This is based on review > > comments of EFI maintainers. > > On am64x_evm_a53, the last test in test/py/tests/test_net_boot.py fails > due to: > ... > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > Filename 'pxelinux.cfg/default-arm-k3'. > Load address: 0x80100000 > Loading: ################################################## 64 Bytes > 8.8 KiB/s > done > Bytes transferred = 64 (40 hex) > 1 pxe ready ethernet 0 port@1.bootdev.0 extlinux/extlinux.conf > ** Booting bootflow 'port@1.bootdev.0' with pxe > Retrieving file: pxelinux.cfg/default-arm > am65_cpsw_nuss_port ethernet@8000000port@1: K3 CPSW: rflow_id_base: 16 > link up on port 1, speed 1000, full duplex > Using ethernet@8000000port@1 device > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > Filename 'pxelinux.cfg/default-arm'. > > TFTP error: trying to overwrite reserved memory... > Couldn't retrieve pxelinux.cfg/default-arm So this seems to be failing because the address used to load the pxe config file seems to be overlapping with an already reserved region of memory. Can you please check if modifying the address works? > > And note that the pxelinux.cfg files are created as defined by the > example within the test. This test is also still fine on Pi 4. If this is working fine on the Pi 4, this is mostly to do with needing to tweak the load address. -sughosh > > -- > Tom
On Mon, Jul 22, 2024 at 11:58:18AM +0530, Sughosh Ganu wrote: > On Mon, 8 Jul 2024 at 19:32, Tom Rini <trini@konsulko.com> wrote: > > > > On Thu, Jul 04, 2024 at 01:04:56PM +0530, Sughosh Ganu wrote: > > > > > The aim of this patch series is to fix the current state of > > > incoherence between modules when it comes to memory usage. The primary > > > issue that this series is trying to fix is that the EFI memory module > > > which is responsible for allocating and freeing memory, does not have > > > any visibility of the memory that is being used by the LMB > > > module. This is further complicated by the fact that the LMB > > > allocations are caller specific -- the LMB memory map is not global > > > nor persistent. This means that the memory "allocated" by the LMB > > > module might be relevant only for a given function. Hence one of the > > > requirements for making the memory usage visible across modules is to > > > make LMB allocations persistent and global, and then have means to > > > communicate the use of memory across modules. > > > > > > The first set of patches in this series work on making the LMB memory > > > map persistent and global. This is being done keeping in mind the > > > usage of LMB memory by platforms where the same memory region can be > > > used to load multiple different images. What is not allowed is to > > > overwrite memory that has been allocated by the other module, > > > currently the EFI memory module. This is being achieved by introducing > > > a new flag, LMB_NOOVERWRITE, which represents memory which cannot be > > > re-requested once allocated. > > > > > > A review comment on the earlier version was to do away with the static > > > arrays for the LMB lists of free and used memory. This version > > > uses the alloced list data structure for the LMB lists. > > > > > > The second set of patches are making changes to the EFI memory module > > > to make use of the LMB functions to allocate and free memory. A > > > *_flags() version of LMB API's has been introduced for the same. The > > > earlier version was using notification mechanism from both LMB and EFI > > > modules to maintain memory coherence. This version makes use of the > > > LMB API functions for the memory allocations. This is based on review > > > comments of EFI maintainers. > > > > On am64x_evm_a53, the last test in test/py/tests/test_net_boot.py fails > > due to: > > ... > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > Filename 'pxelinux.cfg/default-arm-k3'. > > Load address: 0x80100000 > > Loading: ################################################## 64 Bytes > > 8.8 KiB/s > > done > > Bytes transferred = 64 (40 hex) > > 1 pxe ready ethernet 0 port@1.bootdev.0 extlinux/extlinux.conf > > ** Booting bootflow 'port@1.bootdev.0' with pxe > > Retrieving file: pxelinux.cfg/default-arm > > am65_cpsw_nuss_port ethernet@8000000port@1: K3 CPSW: rflow_id_base: 16 > > link up on port 1, speed 1000, full duplex > > Using ethernet@8000000port@1 device > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > Filename 'pxelinux.cfg/default-arm'. > > > > TFTP error: trying to overwrite reserved memory... > > Couldn't retrieve pxelinux.cfg/default-arm > > So this seems to be failing because the address used to load the pxe > config file seems to be overlapping with an already reserved region of > memory. Can you please check if modifying the address works? I'm not sure what address you're thinking of modifying but, this isn't overwriting U-Boot itself so it's a case that needs to work.
On Mon, 22 Jul 2024 at 23:03, Tom Rini <trini@konsulko.com> wrote: > > On Mon, Jul 22, 2024 at 11:58:18AM +0530, Sughosh Ganu wrote: > > On Mon, 8 Jul 2024 at 19:32, Tom Rini <trini@konsulko.com> wrote: > > > > > > On Thu, Jul 04, 2024 at 01:04:56PM +0530, Sughosh Ganu wrote: > > > > > > > The aim of this patch series is to fix the current state of > > > > incoherence between modules when it comes to memory usage. The primary > > > > issue that this series is trying to fix is that the EFI memory module > > > > which is responsible for allocating and freeing memory, does not have > > > > any visibility of the memory that is being used by the LMB > > > > module. This is further complicated by the fact that the LMB > > > > allocations are caller specific -- the LMB memory map is not global > > > > nor persistent. This means that the memory "allocated" by the LMB > > > > module might be relevant only for a given function. Hence one of the > > > > requirements for making the memory usage visible across modules is to > > > > make LMB allocations persistent and global, and then have means to > > > > communicate the use of memory across modules. > > > > > > > > The first set of patches in this series work on making the LMB memory > > > > map persistent and global. This is being done keeping in mind the > > > > usage of LMB memory by platforms where the same memory region can be > > > > used to load multiple different images. What is not allowed is to > > > > overwrite memory that has been allocated by the other module, > > > > currently the EFI memory module. This is being achieved by introducing > > > > a new flag, LMB_NOOVERWRITE, which represents memory which cannot be > > > > re-requested once allocated. > > > > > > > > A review comment on the earlier version was to do away with the static > > > > arrays for the LMB lists of free and used memory. This version > > > > uses the alloced list data structure for the LMB lists. > > > > > > > > The second set of patches are making changes to the EFI memory module > > > > to make use of the LMB functions to allocate and free memory. A > > > > *_flags() version of LMB API's has been introduced for the same. The > > > > earlier version was using notification mechanism from both LMB and EFI > > > > modules to maintain memory coherence. This version makes use of the > > > > LMB API functions for the memory allocations. This is based on review > > > > comments of EFI maintainers. > > > > > > On am64x_evm_a53, the last test in test/py/tests/test_net_boot.py fails > > > due to: > > > ... > > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > > Filename 'pxelinux.cfg/default-arm-k3'. > > > Load address: 0x80100000 > > > Loading: ################################################## 64 Bytes > > > 8.8 KiB/s > > > done > > > Bytes transferred = 64 (40 hex) > > > 1 pxe ready ethernet 0 port@1.bootdev.0 extlinux/extlinux.conf > > > ** Booting bootflow 'port@1.bootdev.0' with pxe > > > Retrieving file: pxelinux.cfg/default-arm > > > am65_cpsw_nuss_port ethernet@8000000port@1: K3 CPSW: rflow_id_base: 16 > > > link up on port 1, speed 1000, full duplex > > > Using ethernet@8000000port@1 device > > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > > Filename 'pxelinux.cfg/default-arm'. > > > > > > TFTP error: trying to overwrite reserved memory... > > > Couldn't retrieve pxelinux.cfg/default-arm > > > > So this seems to be failing because the address used to load the pxe > > config file seems to be overlapping with an already reserved region of > > memory. Can you please check if modifying the address works? > > I'm not sure what address you're thinking of modifying but, this isn't > overwriting U-Boot itself so it's a case that needs to work. Can you please print the lmb memory map through bdinfo and share it with me. That will give some info on what is causing the issue. Thing is, with this patchset, if there is another reservation with a different flag(like LMB_NOMAP, LMB_NOOVERWRITE), this would cause the load to fail. -sughosh > > -- > Tom
Hi Sughosh, On Mon, 22 Jul 2024 at 18:37, Sughosh Ganu <sughosh.ganu@linaro.org> wrote: > > On Mon, 22 Jul 2024 at 23:03, Tom Rini <trini@konsulko.com> wrote: > > > > On Mon, Jul 22, 2024 at 11:58:18AM +0530, Sughosh Ganu wrote: > > > On Mon, 8 Jul 2024 at 19:32, Tom Rini <trini@konsulko.com> wrote: > > > > > > > > On Thu, Jul 04, 2024 at 01:04:56PM +0530, Sughosh Ganu wrote: > > > > > > > > > The aim of this patch series is to fix the current state of > > > > > incoherence between modules when it comes to memory usage. The primary > > > > > issue that this series is trying to fix is that the EFI memory module > > > > > which is responsible for allocating and freeing memory, does not have > > > > > any visibility of the memory that is being used by the LMB > > > > > module. This is further complicated by the fact that the LMB > > > > > allocations are caller specific -- the LMB memory map is not global > > > > > nor persistent. This means that the memory "allocated" by the LMB > > > > > module might be relevant only for a given function. Hence one of the > > > > > requirements for making the memory usage visible across modules is to > > > > > make LMB allocations persistent and global, and then have means to > > > > > communicate the use of memory across modules. > > > > > > > > > > The first set of patches in this series work on making the LMB memory > > > > > map persistent and global. This is being done keeping in mind the > > > > > usage of LMB memory by platforms where the same memory region can be > > > > > used to load multiple different images. What is not allowed is to > > > > > overwrite memory that has been allocated by the other module, > > > > > currently the EFI memory module. This is being achieved by introducing > > > > > a new flag, LMB_NOOVERWRITE, which represents memory which cannot be > > > > > re-requested once allocated. > > > > > > > > > > A review comment on the earlier version was to do away with the static > > > > > arrays for the LMB lists of free and used memory. This version > > > > > uses the alloced list data structure for the LMB lists. > > > > > > > > > > The second set of patches are making changes to the EFI memory module > > > > > to make use of the LMB functions to allocate and free memory. A > > > > > *_flags() version of LMB API's has been introduced for the same. The > > > > > earlier version was using notification mechanism from both LMB and EFI > > > > > modules to maintain memory coherence. This version makes use of the > > > > > LMB API functions for the memory allocations. This is based on review > > > > > comments of EFI maintainers. > > > > > > > > On am64x_evm_a53, the last test in test/py/tests/test_net_boot.py fails > > > > due to: > > > > ... > > > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > > > Filename 'pxelinux.cfg/default-arm-k3'. > > > > Load address: 0x80100000 > > > > Loading: ################################################## 64 Bytes > > > > 8.8 KiB/s > > > > done > > > > Bytes transferred = 64 (40 hex) > > > > 1 pxe ready ethernet 0 port@1.bootdev.0 extlinux/extlinux.conf > > > > ** Booting bootflow 'port@1.bootdev.0' with pxe > > > > Retrieving file: pxelinux.cfg/default-arm > > > > am65_cpsw_nuss_port ethernet@8000000port@1: K3 CPSW: rflow_id_base: 16 > > > > link up on port 1, speed 1000, full duplex > > > > Using ethernet@8000000port@1 device > > > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > > > Filename 'pxelinux.cfg/default-arm'. > > > > > > > > TFTP error: trying to overwrite reserved memory... > > > > Couldn't retrieve pxelinux.cfg/default-arm > > > > > > So this seems to be failing because the address used to load the pxe > > > config file seems to be overlapping with an already reserved region of > > > memory. Can you please check if modifying the address works? > > > > I'm not sure what address you're thinking of modifying but, this isn't > > overwriting U-Boot itself so it's a case that needs to work. > > Can you please print the lmb memory map through bdinfo and share it > with me. That will give some info on what is causing the issue. Thing > is, with this patchset, if there is another reservation with a > different flag(like LMB_NOMAP, LMB_NOOVERWRITE), this would cause the > load to fail. I too am interested in what it might be. Scripts in general can load things where they want, so we need to be careful. Also, as mentioned I would really like to clean up the 'EFI' 'allocations' so that they only happen when booting an EFI payload. I am not too sure what the current state is, but I will see if I can take a look. Regards, SImon
On Mon, Jul 22, 2024 at 11:07:45PM +0530, Sughosh Ganu wrote: > On Mon, 22 Jul 2024 at 23:03, Tom Rini <trini@konsulko.com> wrote: > > > > On Mon, Jul 22, 2024 at 11:58:18AM +0530, Sughosh Ganu wrote: > > > On Mon, 8 Jul 2024 at 19:32, Tom Rini <trini@konsulko.com> wrote: > > > > > > > > On Thu, Jul 04, 2024 at 01:04:56PM +0530, Sughosh Ganu wrote: > > > > > > > > > The aim of this patch series is to fix the current state of > > > > > incoherence between modules when it comes to memory usage. The primary > > > > > issue that this series is trying to fix is that the EFI memory module > > > > > which is responsible for allocating and freeing memory, does not have > > > > > any visibility of the memory that is being used by the LMB > > > > > module. This is further complicated by the fact that the LMB > > > > > allocations are caller specific -- the LMB memory map is not global > > > > > nor persistent. This means that the memory "allocated" by the LMB > > > > > module might be relevant only for a given function. Hence one of the > > > > > requirements for making the memory usage visible across modules is to > > > > > make LMB allocations persistent and global, and then have means to > > > > > communicate the use of memory across modules. > > > > > > > > > > The first set of patches in this series work on making the LMB memory > > > > > map persistent and global. This is being done keeping in mind the > > > > > usage of LMB memory by platforms where the same memory region can be > > > > > used to load multiple different images. What is not allowed is to > > > > > overwrite memory that has been allocated by the other module, > > > > > currently the EFI memory module. This is being achieved by introducing > > > > > a new flag, LMB_NOOVERWRITE, which represents memory which cannot be > > > > > re-requested once allocated. > > > > > > > > > > A review comment on the earlier version was to do away with the static > > > > > arrays for the LMB lists of free and used memory. This version > > > > > uses the alloced list data structure for the LMB lists. > > > > > > > > > > The second set of patches are making changes to the EFI memory module > > > > > to make use of the LMB functions to allocate and free memory. A > > > > > *_flags() version of LMB API's has been introduced for the same. The > > > > > earlier version was using notification mechanism from both LMB and EFI > > > > > modules to maintain memory coherence. This version makes use of the > > > > > LMB API functions for the memory allocations. This is based on review > > > > > comments of EFI maintainers. > > > > > > > > On am64x_evm_a53, the last test in test/py/tests/test_net_boot.py fails > > > > due to: > > > > ... > > > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > > > Filename 'pxelinux.cfg/default-arm-k3'. > > > > Load address: 0x80100000 > > > > Loading: ################################################## 64 Bytes > > > > 8.8 KiB/s > > > > done > > > > Bytes transferred = 64 (40 hex) > > > > 1 pxe ready ethernet 0 port@1.bootdev.0 extlinux/extlinux.conf > > > > ** Booting bootflow 'port@1.bootdev.0' with pxe > > > > Retrieving file: pxelinux.cfg/default-arm > > > > am65_cpsw_nuss_port ethernet@8000000port@1: K3 CPSW: rflow_id_base: 16 > > > > link up on port 1, speed 1000, full duplex > > > > Using ethernet@8000000port@1 device > > > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > > > Filename 'pxelinux.cfg/default-arm'. > > > > > > > > TFTP error: trying to overwrite reserved memory... > > > > Couldn't retrieve pxelinux.cfg/default-arm > > > > > > So this seems to be failing because the address used to load the pxe > > > config file seems to be overlapping with an already reserved region of > > > memory. Can you please check if modifying the address works? > > > > I'm not sure what address you're thinking of modifying but, this isn't > > overwriting U-Boot itself so it's a case that needs to work. > > Can you please print the lmb memory map through bdinfo and share it > with me. That will give some info on what is causing the issue. Thing > is, with this patchset, if there is another reservation with a > different flag(like LMB_NOMAP, LMB_NOOVERWRITE), this would cause the > load to fail. Well hunh. I thought I had reproduced the issue before posting, but I just pushed the same tree (I'm fairly certain) over to my lab and the tests are passing now. So, lets just see what happens with the next iteration of the series, sorry for the noise.
On Tue, 23 Jul 2024 at 20:18, Tom Rini <trini@konsulko.com> wrote: > > On Mon, Jul 22, 2024 at 11:07:45PM +0530, Sughosh Ganu wrote: > > On Mon, 22 Jul 2024 at 23:03, Tom Rini <trini@konsulko.com> wrote: > > > > > > On Mon, Jul 22, 2024 at 11:58:18AM +0530, Sughosh Ganu wrote: > > > > On Mon, 8 Jul 2024 at 19:32, Tom Rini <trini@konsulko.com> wrote: > > > > > > > > > > On Thu, Jul 04, 2024 at 01:04:56PM +0530, Sughosh Ganu wrote: > > > > > > > > > > > The aim of this patch series is to fix the current state of > > > > > > incoherence between modules when it comes to memory usage. The primary > > > > > > issue that this series is trying to fix is that the EFI memory module > > > > > > which is responsible for allocating and freeing memory, does not have > > > > > > any visibility of the memory that is being used by the LMB > > > > > > module. This is further complicated by the fact that the LMB > > > > > > allocations are caller specific -- the LMB memory map is not global > > > > > > nor persistent. This means that the memory "allocated" by the LMB > > > > > > module might be relevant only for a given function. Hence one of the > > > > > > requirements for making the memory usage visible across modules is to > > > > > > make LMB allocations persistent and global, and then have means to > > > > > > communicate the use of memory across modules. > > > > > > > > > > > > The first set of patches in this series work on making the LMB memory > > > > > > map persistent and global. This is being done keeping in mind the > > > > > > usage of LMB memory by platforms where the same memory region can be > > > > > > used to load multiple different images. What is not allowed is to > > > > > > overwrite memory that has been allocated by the other module, > > > > > > currently the EFI memory module. This is being achieved by introducing > > > > > > a new flag, LMB_NOOVERWRITE, which represents memory which cannot be > > > > > > re-requested once allocated. > > > > > > > > > > > > A review comment on the earlier version was to do away with the static > > > > > > arrays for the LMB lists of free and used memory. This version > > > > > > uses the alloced list data structure for the LMB lists. > > > > > > > > > > > > The second set of patches are making changes to the EFI memory module > > > > > > to make use of the LMB functions to allocate and free memory. A > > > > > > *_flags() version of LMB API's has been introduced for the same. The > > > > > > earlier version was using notification mechanism from both LMB and EFI > > > > > > modules to maintain memory coherence. This version makes use of the > > > > > > LMB API functions for the memory allocations. This is based on review > > > > > > comments of EFI maintainers. > > > > > > > > > > On am64x_evm_a53, the last test in test/py/tests/test_net_boot.py fails > > > > > due to: > > > > > ... > > > > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > > > > Filename 'pxelinux.cfg/default-arm-k3'. > > > > > Load address: 0x80100000 > > > > > Loading: ################################################## 64 Bytes > > > > > 8.8 KiB/s > > > > > done > > > > > Bytes transferred = 64 (40 hex) > > > > > 1 pxe ready ethernet 0 port@1.bootdev.0 extlinux/extlinux.conf > > > > > ** Booting bootflow 'port@1.bootdev.0' with pxe > > > > > Retrieving file: pxelinux.cfg/default-arm > > > > > am65_cpsw_nuss_port ethernet@8000000port@1: K3 CPSW: rflow_id_base: 16 > > > > > link up on port 1, speed 1000, full duplex > > > > > Using ethernet@8000000port@1 device > > > > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > > > > Filename 'pxelinux.cfg/default-arm'. > > > > > > > > > > TFTP error: trying to overwrite reserved memory... > > > > > Couldn't retrieve pxelinux.cfg/default-arm > > > > > > > > So this seems to be failing because the address used to load the pxe > > > > config file seems to be overlapping with an already reserved region of > > > > memory. Can you please check if modifying the address works? > > > > > > I'm not sure what address you're thinking of modifying but, this isn't > > > overwriting U-Boot itself so it's a case that needs to work. > > > > Can you please print the lmb memory map through bdinfo and share it > > with me. That will give some info on what is causing the issue. Thing > > is, with this patchset, if there is another reservation with a > > different flag(like LMB_NOMAP, LMB_NOOVERWRITE), this would cause the > > load to fail. > > Well hunh. I thought I had reproduced the issue before posting, but I > just pushed the same tree (I'm fairly certain) over to my lab and the > tests are passing now. So, lets just see what happens with the next > iteration of the series, sorry for the noise. Okay, I will put out the LMB only, non-rfc series once the CI has gone through fine. Btw, I hope you have seen my comment on irc about having the SPL_LMB config symbol as a bool, instead of def_bool y. Thanks. -sughosh > > -- > Tom
On Tue, Jul 23, 2024 at 08:21:11PM +0530, Sughosh Ganu wrote: > On Tue, 23 Jul 2024 at 20:18, Tom Rini <trini@konsulko.com> wrote: > > > > On Mon, Jul 22, 2024 at 11:07:45PM +0530, Sughosh Ganu wrote: > > > On Mon, 22 Jul 2024 at 23:03, Tom Rini <trini@konsulko.com> wrote: > > > > > > > > On Mon, Jul 22, 2024 at 11:58:18AM +0530, Sughosh Ganu wrote: > > > > > On Mon, 8 Jul 2024 at 19:32, Tom Rini <trini@konsulko.com> wrote: > > > > > > > > > > > > On Thu, Jul 04, 2024 at 01:04:56PM +0530, Sughosh Ganu wrote: > > > > > > > > > > > > > The aim of this patch series is to fix the current state of > > > > > > > incoherence between modules when it comes to memory usage. The primary > > > > > > > issue that this series is trying to fix is that the EFI memory module > > > > > > > which is responsible for allocating and freeing memory, does not have > > > > > > > any visibility of the memory that is being used by the LMB > > > > > > > module. This is further complicated by the fact that the LMB > > > > > > > allocations are caller specific -- the LMB memory map is not global > > > > > > > nor persistent. This means that the memory "allocated" by the LMB > > > > > > > module might be relevant only for a given function. Hence one of the > > > > > > > requirements for making the memory usage visible across modules is to > > > > > > > make LMB allocations persistent and global, and then have means to > > > > > > > communicate the use of memory across modules. > > > > > > > > > > > > > > The first set of patches in this series work on making the LMB memory > > > > > > > map persistent and global. This is being done keeping in mind the > > > > > > > usage of LMB memory by platforms where the same memory region can be > > > > > > > used to load multiple different images. What is not allowed is to > > > > > > > overwrite memory that has been allocated by the other module, > > > > > > > currently the EFI memory module. This is being achieved by introducing > > > > > > > a new flag, LMB_NOOVERWRITE, which represents memory which cannot be > > > > > > > re-requested once allocated. > > > > > > > > > > > > > > A review comment on the earlier version was to do away with the static > > > > > > > arrays for the LMB lists of free and used memory. This version > > > > > > > uses the alloced list data structure for the LMB lists. > > > > > > > > > > > > > > The second set of patches are making changes to the EFI memory module > > > > > > > to make use of the LMB functions to allocate and free memory. A > > > > > > > *_flags() version of LMB API's has been introduced for the same. The > > > > > > > earlier version was using notification mechanism from both LMB and EFI > > > > > > > modules to maintain memory coherence. This version makes use of the > > > > > > > LMB API functions for the memory allocations. This is based on review > > > > > > > comments of EFI maintainers. > > > > > > > > > > > > On am64x_evm_a53, the last test in test/py/tests/test_net_boot.py fails > > > > > > due to: > > > > > > ... > > > > > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > > > > > Filename 'pxelinux.cfg/default-arm-k3'. > > > > > > Load address: 0x80100000 > > > > > > Loading: ################################################## 64 Bytes > > > > > > 8.8 KiB/s > > > > > > done > > > > > > Bytes transferred = 64 (40 hex) > > > > > > 1 pxe ready ethernet 0 port@1.bootdev.0 extlinux/extlinux.conf > > > > > > ** Booting bootflow 'port@1.bootdev.0' with pxe > > > > > > Retrieving file: pxelinux.cfg/default-arm > > > > > > am65_cpsw_nuss_port ethernet@8000000port@1: K3 CPSW: rflow_id_base: 16 > > > > > > link up on port 1, speed 1000, full duplex > > > > > > Using ethernet@8000000port@1 device > > > > > > TFTP from server 192.168.116.10; our IP address is 192.168.116.23 > > > > > > Filename 'pxelinux.cfg/default-arm'. > > > > > > > > > > > > TFTP error: trying to overwrite reserved memory... > > > > > > Couldn't retrieve pxelinux.cfg/default-arm > > > > > > > > > > So this seems to be failing because the address used to load the pxe > > > > > config file seems to be overlapping with an already reserved region of > > > > > memory. Can you please check if modifying the address works? > > > > > > > > I'm not sure what address you're thinking of modifying but, this isn't > > > > overwriting U-Boot itself so it's a case that needs to work. > > > > > > Can you please print the lmb memory map through bdinfo and share it > > > with me. That will give some info on what is causing the issue. Thing > > > is, with this patchset, if there is another reservation with a > > > different flag(like LMB_NOMAP, LMB_NOOVERWRITE), this would cause the > > > load to fail. > > > > Well hunh. I thought I had reproduced the issue before posting, but I > > just pushed the same tree (I'm fairly certain) over to my lab and the > > tests are passing now. So, lets just see what happens with the next > > iteration of the series, sorry for the noise. > > Okay, I will put out the LMB only, non-rfc series once the CI has gone > through fine. Btw, I hope you have seen my comment on irc about having > the SPL_LMB config symbol as a bool, instead of def_bool y. Thanks. I'll investigate that further once I can poke at the code, thanks.