summaryrefslogtreecommitdiff
path: root/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c
AgeCommit message (Expand)AuthorFilesLines
2025-01-06drm/amdgpu: Fix error handling in amdgpu_ras_add_bad_pagesSrinivasan Shanmugam1-5/+16
2024-12-18drm/amdgpu: Enable psp v14_0_3 RAS support for non-SRIOV configurations.Candice Li1-1/+1
2024-12-10drm/amdgpu: Support nbif v6_3_1 fatal error handlingCandice Li1-0/+12
2024-12-10drm/amdgpu: Add psp v14_0_3 ras supportCandice Li1-0/+1
2024-12-10drm/amdgpu: Enable RAS for psp v13_0_12Hawking Zhang1-0/+5
2024-12-10drm/amdgpu: correct the calculation of RAS bad pageTao Zhou1-8/+2
2024-12-10drm/amdgpu: split ras_eeprom_init into init and check functionsTao Zhou1-4/+11
2024-12-10drm/amdgpu: remove is_mca_add for ras_add_bad_pagesTao Zhou1-11/+5
2024-12-10drm/amdgpu: parse legacy RAS bad page mixed with new data in various NPS modesTao Zhou1-15/+81
2024-12-10drm/amdgpu: support to find RAS bad pages via old TATao Zhou1-3/+25
2024-12-10drm/amdgpu: store only one RAS bad page record for all pages in one rowTao Zhou1-8/+27
2024-12-10drm/amdgpu: Prefer RAS recovery for scheduler hangLijo Lazar1-2/+53
2024-12-10drm/amdgpu: do RAS MCA2PA conversion in device init phaseTao Zhou1-12/+82
2024-12-10drm/amdgpu: add flag to indicate the type of RAS eeprom recordTao Zhou1-7/+26
2024-11-20drm/amdgpu: Use reset recovery state checksLijo Lazar1-5/+5
2024-11-11drm/amdgpu: Implement virt req_ras_err_countVictor Skvortsov1-7/+65
2024-11-11drm/amdgpu: VF Query RAS Caps from Host if supportedVictor Skvortsov1-0/+5
2024-11-04drm/amdgpu: Skip IP coredump for RAS errorsLijo Lazar1-0/+1
2024-09-27drm/amdgpu: Refactor XGMI reset on init handlingLijo Lazar1-6/+0
2024-09-27drm/amdgpu: Add helper to initialize badpage infoLijo Lazar1-18/+38
2024-09-27drm/amdgpu: Use init level for pending_reset flagLijo Lazar1-1/+1
2024-09-27amd/amdgpu: Reduce unnecessary repetitive GPU resetsYiPeng Chai1-1/+20
2024-09-18drm/amdgpu: fix typo in the commentYan Zhen1-1/+1
2024-09-17drm/amdgpu: disable GPU RAS bad page feature for specific ASICTao Zhou1-0/+5
2024-08-06drm/amdgpu: remove RAS unused paramter 'err_addr'Yang Wang1-9/+9
2024-08-06drm/amdgpu: create function to check RAS RMA statusTao Zhou1-6/+16
2024-08-06drm/amdgpu: Add more types for boot time error reportingHawking Zhang1-0/+10
2024-07-24drm/amdgpu: Remove unused codeYiPeng Chai1-23/+0
2024-07-10drm/amdgpu: timely save bad pages to eeprom after gpu ras reset is completedYiPeng Chai1-1/+5
2024-07-10drm/amdgpu: flush all cached ras bad pages to eepromYiPeng Chai1-6/+29
2024-07-08drm/amdgpu: add ras event state device attribute supportYang Wang1-4/+52
2024-07-08drm/amdgpu: add ras POSION_CONSUMPTION event id supportYang Wang1-3/+13
2024-07-08drm/amdgpu: add ras POSION_CREATION event id supportYang Wang1-3/+14
2024-07-08drm/amdgpu: refine amdgpu ras event id core codeYang Wang1-18/+84
2024-07-08drm/amdgpu: sysfs node disable query error count during gpu resetYiPeng Chai1-0/+3
2024-07-01drm/amdgpu: Fix hbm stack id in boot error reportHawking Zhang1-1/+1
2024-06-28drm/amdgpu: add gpu reset check and exception handlingYiPeng Chai1-0/+53
2024-06-28drm/amdgpu: refine poison consumption interrupt handlerYiPeng Chai1-18/+37
2024-06-28drm/amdgpu: refine poison creation interrupt handlerYiPeng Chai1-22/+17
2024-06-28drm/amdgpu: add variable to record the deferred error number read by driverYiPeng Chai1-18/+44
2024-06-14drm/amdgpu: set RAS fed status for more casesTao Zhou1-0/+1
2024-06-14drm/amdgpu: create amdgpu_ras_in_recovery to simplify codeTao Zhou1-12/+19
2024-06-14drm/amdgpu: trigger mode1 reset for RAS RMA statusTao Zhou1-6/+22
2024-06-14drm/amdgpu: move aca/mca init functions into ras_init() stageYang Wang1-23/+50
2024-06-14drm/amdgpu: add reset source in various casesEric Huang1-0/+1
2024-06-05drm/amdgpu: add RAS is_rma flagTao Zhou1-5/+4
2024-06-05drm/amdgpu: Update programming for boot error reportingHawking Zhang1-54/+45
2024-06-05drm/amdgpu: Estimate RAS reservation when report capacity v2Hawking Zhang1-0/+20
2024-05-29drm/amdgpu: fix typo in amdgpu_ras_aca_sysfs_read() functionYang Wang1-1/+1
2024-05-23drm/amdgpu: skip to create ras xxx_err_count node when ACA is enabledYang Wang1-0/+6