Hi, On 8/26/25 10:18 AM, Jason Gunthorpe wrote: > Add some general description and pull in the kdoc comments from the source > file to index most of the useful functions. > > Tested-by: Alejandro Jimenez <alejandro.j.jimenez@xxxxxxxxxx> > Signed-off-by: Jason Gunthorpe <jgg@xxxxxxxxxx> > --- > Documentation/driver-api/generic_pt.rst | 140 ++++++++++++++++++++++++ > Documentation/driver-api/index.rst | 1 + > 2 files changed, 141 insertions(+) > create mode 100644 Documentation/driver-api/generic_pt.rst > > diff --git a/Documentation/driver-api/generic_pt.rst b/Documentation/driver-api/generic_pt.rst > new file mode 100644 > index 00000000000000..45b05dafece814 > --- /dev/null > +++ b/Documentation/driver-api/generic_pt.rst > @@ -0,0 +1,140 @@ > +.. SPDX-License-Identifier: GPL-2.0 > + > +======================== > +Generic Radix Page Table > +======================== > + > +.. kernel-doc:: include/linux/generic_pt/common.h > + :doc: Generic Radix Page Table > + > +.. kernel-doc:: drivers/iommu/generic_pt/pt_defs.h > + :doc: Generic Page Table Language > + > +----- > +Usage > +----- > + > +Generic PT is structured as a multi-compilation system. Since each format > +provides an API using a common set of names there can be only one format active > +within a compilation unit. This design avoids function pointers around the low > +level API. > + > +Instead the function pointers can end up at the higher level API (ie map/unmap, (i.e., > +etc) and the per-format code can be directly inlined into the per-format etc.) > +compilation unit. For something like iommu each format will be compiled into a > +per-format iommu operations kernel module. > + I would s/iommu/IOMMU/g when it's used in a sentence as an acronym (i.e., not functions names, struct names, struct fields, file names, etc.). > +For this to work the .c file for each compilation unit will include both the > +format headers and the generic code for the implementation. For instance in an > +implementation compilation unit the headers would normally be included as > +follows:: > + > + #include <linux/generic_pt/common.h> > + #include "fmt/defs_amdv1.h" > + #include "pt_defs.h" > + #include "fmt/amdv1.h" > + #include "pt_common.h" > + #include "pt_iter.h" > + #include "iommut_pt.h" /* The iommu implementation */ > + > +iommu_pt.h includes definitions that will generate the operations functions for > +map/unmap/etc using the definitions provided by AMDv1. The resulting module etc. > +will have exported symbols named like pt_iommu_amdv1_init(). > + > +Refer to drivers/iommu/generic-pt/fmt/iommu_template.h for an example of how the > +iommu implementation uses multi-compilation to generate per-format ops structs > +pointers. > + > +The format code is written so that the common names arise from #defines to > +distinct format specific names. This is intended to aid debuggability by > +avoiding symbol clashes across all the different formats. > + > +Exported symbols and other global names are mangled using a per-format string > +via the NS() helper macro. > + > +The format uses struct pt_common as the top level struct for the table, top-level > +and each format will have its own struct pt_xxx which embeds it to store > +format-specific information. > + > +The implementation will further wrapper this in its own top level struct, such this??? top-level oh, maybe wrap this top-level > +as struct pt_iommu_amdv1. > + > +---------------------------------------------- > +Format functions at the struct pt_common level > +---------------------------------------------- > + > +.. kernel-doc:: include/linux/generic_pt/common.h > + :identifiers: > +.. kernel-doc:: drivers/iommu/generic_pt/pt_common.h > + > +----------------- > +Iteration Helpers > +----------------- > + > +.. kernel-doc:: drivers/iommu/generic_pt/pt_iter.h > + > +---------------- > +Writing a Format > +---------------- > + > +It is best to start from a simple format that is similar to the target. x86_64 > +is usually a good reference for something simple, and AMDv1 is something fairly > +complete. > + > +The required inline functions need to be implemented in the format header. > +These should all follow the standard pattern of:: > + > + static inline pt_oaddr_t amdv1pt_entry_oa(const struct pt_state *pts) > + { > + [..] > + } > + #define pt_entry_oa amdv1pt_entry_oa > + > +Where a uniquely named per-format inline function provides the implementation where > +and a define maps it to the generic name. This is intended to make debug symbols > +work better. inline functions should always be used as the prototypes in > +pt_common.h will cause the compiler to validate the function signature to > +prevent errors. > + > +Review pt_fmt_defaults.h to understand some of the optional inlines. > + > +Once the format compiles then it should be run through the generic page table > +kunit test in kunit_generic_pt.h using kunit. For example:: > + > + $ tools/testing/kunit/kunit.py run --build_dir build_kunit_x86_64 --arch x86_64 --kunitconfig ./drivers/iommu/generic_pt/.kunitconfig amdv1_fmt_test.* > + [...] > + [11:15:08] Testing complete. Ran 9 tests: passed: 9 > + [11:15:09] Elapsed time: 3.137s total, 0.001s configuring, 2.368s building, 0.311s running > + > +The generic tests are intended to prove out the format functions and give > +clearer failures to speed up finding the problems. Once those pass then the > +entire kunit suite should be run. > + > +--------------------------- > +IOMMU Invalidation Features > +--------------------------- > + > +Invalidation is how the page table algorithms synchronize with a HW cache of the > +pagetable memory, typically called the TLB (or IOTLB for IOMMU cases). page table to match the rest of this document. > + > +The TLB can store present PTEs, non-present PTEs and table pointers, depending > +on its design. Every HW has its own approach on how to describe what has changed > +to get changed items removed from the TLB. to have changed items removed > + > +PT_FEAT_FLUSH_RANGE > +------------------- > + > +PT_FEAT_FLUSH_RANGE is the easiest scheme to understand. It tries to generate a > +single range invalidation for each operation, over invalidating if there are over-invalidating > +gaps of VA that don't need invalidation. This trades off impacted VA for number > +of invalidation operations. It does not keep track of what is being invalidated, invalidated; > +however if pages have to be freed then page table pointers have to be cleaned however, > +from the walk cache. The range can start/end at any page boundary. > + > +PT_FEAT_FLUSH_RANGE_NO_GAPS > +--------------------------- > + > +PT_FEAT_FLUSH_RANGE_NO_GAPS is similar to PT_FEAT_FLUSH_RANGE however it tries PT_FEAT_FLUSH_RANGE; however, it tries > +to minimize the amount of impacted VA by issuing extra flush operations. This is > +useful if the cost of processing VA is very high, for instance because a > +hypervisor is processing the page table with a shadowing algorithm. -- ~Randy