YiPeng Chai
127cdd726f
drm/amd/ras: ras supports i2c eeprom for mp1 v13_0_12
...
ras supports i2c eeprom for mp1 v13_0_12.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Acked-by: Alex Deucher <alexander.deucher@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-11-06 09:57:07 -05:00
YiPeng Chai
d95ca7f515
drm/amdgpu: suspend ras module before gpu reset
...
During gpu reset, all GPU-related resources are
inaccessible. To avoid affecting ras functionality,
suspend ras module before gpu reset and resume
it after gpu reset is complete.
V2:
Rename functions to avoid misunderstanding.
V3:
Move flush_delayed_work to amdgpu_ras_process_pause,
Move schedule_delayed_work to amdgpu_ras_process_unpause.
V4:
Rename functions.
V5:
Move the function to amdgpu_ras.c.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Acked-by: Lijo Lazar <lijo.lazar@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-11-04 11:53:59 -05:00
YiPeng Chai
3f16007d86
drm/amd/ras: Add ras support for umc v12_5_0
...
Add ras support for umc v12_5_0.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-11-04 11:53:22 -05:00
YiPeng Chai
d7f105a402
drm/amd/ras: Add ras support for nbio v7_9_1
...
Add ras support for nbio v7_9_1.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-11-04 11:53:22 -05:00
YiPeng Chai
36265d2bcc
drm/amd/ras: Increase ras switch control range
...
Increase ras switch control range.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-11-04 11:53:22 -05:00
YiPeng Chai
fd0e35bdd3
drm/amd/ras: Fix the error of undefined reference to `__udivdi3'
...
Fix the error:
drivers/gpu/drm/amd/amdgpu/../ras/ras_mgr/amdgpu_ras_mgr.c:132:undefined reference to `__udivdi3'
Fixes: fa0b203cd9 ("drm/amd/ras: Add amdgpu ras management function.")
Reported-by: kernel test robot <lkp@intel.com >
Closes: https://lore.kernel.org/oe-kbuild-all/202510272144.6SUHUoWx-lkp@intel.com/
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Yang Wang <kevinyang.wang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-11-04 11:51:30 -05:00
YiPeng Chai
25c1e7414b
drm/amd/ras: Update function and remove redundant code
...
Update function and remove redundant code:
1. Update function to prepare for internal use.
2. Remove unused function code previously prepared
for ioctl.
V2:
Update commit message content.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-20 18:27:37 -04:00
YiPeng Chai
4c74635afd
drm/amd/ras: Update ras command context structure name
...
According to the actual usage of this structure,
it is more appropriate to call it context, the
structure name with ioctl is easy to cause
misunderstanding.
V2:
Update commit message content.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-20 18:27:33 -04:00
YiPeng Chai
6588766d08
drm/amdgpu: Enable ras module
...
Enable ras module, disabled by default.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-20 18:26:05 -04:00
YiPeng Chai
7169e706c8
drm/amdgpu: Add ras module ip block to amdgpu discovery
...
Add ras module ip block to amdgpu discovery.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-20 18:25:54 -04:00
YiPeng Chai
408bd841ad
drm/amdgpu: Improve ras fatal error handling function
...
In multi-gpu case, a fatal error will generate several
fatal error interrupts. After improving this function,
the ras module can reuse this function to only
handle the first interrupt.
V3:
Initialize event_id using RAS_EVENT_INVALID_ID.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-20 18:18:35 -04:00
YiPeng Chai
3d72d2e5f4
drm/amdgpu: Intercept ras interrupts to ras module
...
Intercept ras interrupts to ras module.
V2:
Change function names in ras module.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-20 18:18:26 -04:00
YiPeng Chai
ace232eff5
drm/amdgpu: Add ras module files into amdgpu
...
Add ras module files into amdgpu.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-13 14:14:36 -04:00
YiPeng Chai
7e1252105e
drm/amd/ras: Add files to amdgpu ras manager makefile
...
Add files to amdgpu ras manager makefile.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-13 14:14:31 -04:00
YiPeng Chai
fa0b203cd9
drm/amd/ras: Add amdgpu ras management function.
...
Add amdgpu system configuration parameters and
functions needed by rascore.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-13 14:14:31 -04:00
YiPeng Chai
e221ac6f42
drm/amd/ras: Amdgpu preprocesses ras interrupts
...
Amdgpu preprocesses ras interrupts.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-13 14:14:30 -04:00
YiPeng Chai
ffdab7f4e5
drm/amd/ras: Add amdgpu ras system functions
...
Add amdgpu ras system functions.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-13 14:14:30 -04:00
YiPeng Chai
b658fadf1c
drm/amd/ras: Amdgpu handle ras ioctl command
...
Amdgpu handle ras ioctl command.
V2:
Remove non-standard device information.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-13 14:14:30 -04:00
YiPeng Chai
764e868928
drm/amd/ras: Add amdgpu eeprom i2c configuration function
...
Add amdgpu eeprom i2c configuration function.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-13 14:14:30 -04:00
YiPeng Chai
585fe8f3b3
drm/amd/ras: Add amdgpu mp1 v13_0 configuration function
...
Add amdgpu mp1 v13_0 configuration function.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-13 14:14:30 -04:00
YiPeng Chai
960cc53ca8
drm/amd/ras: Add amdgpu nbio v7_9 configuration function
...
Add amdgpu nbio v7_9 configuration function.
Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com >
Reviewed-by: Tao Zhou <tao.zhou1@amd.com >
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com >
Signed-off-by: Alex Deucher <alexander.deucher@amd.com >
2025-10-13 14:14:30 -04:00