GSP Shared Memory: Difference between revisions
PSC + small edits |
m Use conventional names |
||
(5 intermediate revisions by the same user not shown) | |||
Line 34: | Line 34: | ||
GSP fills the interrupt list, then triggers the event set with [[GSPGPU:RegisterInterruptRelayQueue|RegisterInterruptRelayQueue]] for the specified process(es). | GSP fills the interrupt list, then triggers the event set with [[GSPGPU:RegisterInterruptRelayQueue|RegisterInterruptRelayQueue]] for the specified process(es). | ||
PDC interrupts are sent to all processes; other interrupts are only sent to the process with | PDC interrupts are sent to all processes; other interrupts are only sent to the process with rendering rights. | ||
When issuing a [[GSP_Shared_Memory#Trigger_Memory_Fill|Memory Fill]] command with both buffers set GSP will only dispatch PSC0. | |||
= Framebuffer Info = | = Framebuffer Info = | ||
Line 96: | Line 98: | ||
See [[Configuration Memory]]. | See [[Configuration Memory]]. | ||
= Command Queue = | = GX Command Queue = | ||
This command queue is located at sharedMemBase + 0x800 + (clientID * 0x200). It consists of an header followed by at most 15 command entries. | |||
The queue header has the following structure: | |||
{| class="wikitable" border="1" | {| class="wikitable" border="1" | ||
Line 116: | Line 116: | ||
|- | |- | ||
| 2 | | 2 | ||
| Status ( | | Status (0x1 = halted, 0x80 = error) | ||
|- | |- | ||
| 3 | | 3 | ||
Line 125: | Line 125: | ||
|} | |} | ||
After adding a command, [[GSPGPU:TriggerCmdReqQueue|TriggerCmdReqQueue]] must be used to start command processing (official code does so when the total commands field is 1). | |||
== Commands == | |||
A command entry is made of 8 words. The first word is the command header, subsequent words represent command specific parameters. | |||
The command header has the following structure: | |||
{| class="wikitable" border="1" | {| class="wikitable" border="1" | ||
Line 147: | Line 151: | ||
|} | |} | ||
Addresses specified in command parameters are virtual addresses. Depending on the command, there might be constraints on the accepted parameters. In general, some commands require parameters to be aligned, and addresses are expected to be on [[Memory_Management#Memory_Mapping|linear]], [[Memory_layout#0x1F000000_.28New_3DS_only.29|QTM]] or VRAM memory. | |||
Addresses specified in parameters are virtual addresses. | |||
=== | === RequestDMA === | ||
{| class="wikitable" border="1" | {| class="wikitable" border="1" | ||
Line 177: | Line 179: | ||
|} | |} | ||
This command issues a [[Corelink_DMA_Engines|DMA request]] as the | This command issues a [[Corelink_DMA_Engines|DMA request]] as the process with [[GSPGPU:AcquireRight|rendering rights]]. When the destination address is within VRAM, GSP places itself as the destination process: this makes it possible to transfer data in VRAM without needing it listed in the destination process [[NCCH/Extended_Header#ARM11_Kernel_Capabilities|exheader mappings]]. Otherwise, both source and destination of the DMA request are the process with rendering rights. | ||
The source buffer must be mapped as readable in the source process, while the destination address must be mapped as writable in the destination process, otherwise GSP calls [[SVC|svcBreak]]. When flushing is enabled and the source address is above VRAM, svcFlushProcessDataCache is used to flush the source buffer. | |||
Any process must have acquired rendering rights, otherwise the command does nothing. | |||
=== | === ProcessCommandList === | ||
{| class="wikitable" border="1" | {| class="wikitable" border="1" | ||
Line 209: | Line 213: | ||
This command sets the [[GPU/External_Registers#Command_List|Command List registers]], and optionally updates gas additive blend results after command processing has ended. | This command sets the [[GPU/External_Registers#Command_List|Command List registers]], and optionally updates gas additive blend results after command processing has ended. | ||
No error checking is performed on the parameters. Address and size should be both aligned to 8 bytes, and the address should be in linear, QTM or VRAM memory, otherwise PA 0 is used. When flushing is enabled, svcFlushProcessDataCache is used to flush the buffer. | No error checking is performed on the parameters. Address and size should be both aligned to 8 bytes, and the address should be in linear, QTM or VRAM memory, otherwise PA 0 is used. When flushing is enabled, [[SVC|svcFlushProcessDataCache]] is used to flush the buffer on the process that has acquired rendering rights. | ||
=== | Any process must have acquired rendering rights, otherwise the command does nothing. | ||
=== MemoryFill === | |||
{| class="wikitable" border="1" | {| class="wikitable" border="1" | ||
Line 243: | Line 249: | ||
|} | |} | ||
This command sets the | This command sets the [[GPU/External_Registers#Memory_Fill|Memory Fill registers]]. | ||
Addresses should be aligned to 8 bytes and must be in linear, QTM or VRAM memory, otherwise error 0xE0E02BF5 (GSP_INVALID_ADDRESS) is returned. The start address for a buffer must be | Addresses should be aligned to 8 bytes and must be in linear, QTM or VRAM memory, otherwise error 0xE0E02BF5 (GSP_INVALID_ADDRESS) is returned. The start address for a buffer must be below its end address, else the same error is returned. If the start address for a buffer is 0, that buffer is skipped; otherwise, its relative PSC unit is used for the fill operation. | ||
=== | === DisplayTransfer === | ||
{| class="wikitable" border="1" | {| class="wikitable" border="1" | ||
Line 258: | Line 264: | ||
|- | |- | ||
| 1 | | 1 | ||
| | | Source address | ||
|- | |- | ||
| 2 | | 2 | ||
| | | Destination address | ||
|- | |- | ||
| 3 | | 3 | ||
| | | Source dimensions | ||
|- | |- | ||
| 4 | | 4 | ||
| Output | | Output dimensions | ||
|- | |- | ||
| 5 | | 5 | ||
| Flags | |||
|- | |- | ||
| 7-6 | | 7-6 | ||
Line 276: | Line 282: | ||
|} | |} | ||
This command | This command sets the [[GPU/External_Registers#Transfer_Engine|Display Transfer registers]]. | ||
No error checking is performed on the parameters. Addresses should be aligned to 8 bytes and should be in linear, QTM or VRAM memory, otherwise PA 0 is used. | |||
=== | === TextureCopy === | ||
{| class="wikitable" border="1" | {| class="wikitable" border="1" | ||
Line 295: | Line 297: | ||
|- | |- | ||
| 1 | | 1 | ||
| | | Source address | ||
|- | |- | ||
| 2 | | 2 | ||
| | | Destination address | ||
|- | |- | ||
| 3 | | 3 | ||
| | | Size | ||
|- | |- | ||
| 4 | | 4 | ||
| | | Line width <nowiki>|</nowiki> (gap << 16) | ||
|- | |- | ||
| 5 | | 5 | ||
| Same as | | Same as above, for the destination | ||
|- | |- | ||
| 6 | | 6 | ||
| Flags | | Flags | ||
|- | |- | ||
| 7 | | 7 | ||
Line 316: | Line 318: | ||
|} | |} | ||
This command | This command sets the [[GPU/External_Registers#TextureCopy|Texture Copy registers]]. Note that GSP doesn't enforce bit3 of the flags to be set. | ||
=== | No error checking is performed on the parameters. Addresses and size should be aligned to 8 bytes, and the addresses should be in linear, QTM or VRAM memory, otherwise PA 0 is used. | ||
=== FlushCacheRegions === | |||
{| class="wikitable" border="1" | {| class="wikitable" border="1" | ||
Line 329: | Line 333: | ||
|- | |- | ||
| 1 | | 1 | ||
| | | Buffer 0 address | ||
|- | |- | ||
| 2 | | 2 | ||
| | | Buffer 0 size | ||
|- | |- | ||
| 3 | | 3 | ||
| | | Buffer 1 address | ||
|- | |- | ||
| 4 | | 4 | ||
| | | Buffer 1 size | ||
|- | |- | ||
| 5 | | 5 | ||
| | | Buffer 2 address | ||
|- | |- | ||
| 6 | | 6 | ||
| | | Buffer 2 size | ||
|- | |- | ||
| 7 | | 7 | ||
Line 350: | Line 354: | ||
|} | |} | ||
This command calls svcFlushProcessDataCache for each buffer on the process that has acquired rendering rights. | |||
If any call fails, its error is returned; If any buffer has size 0, the buffer is skipped. In both cases, subsequent buffers are not processed. | |||
Any process must have acquired rendering rights, otherwise the error 0xD8202A06 (GSP_NO_RIGHT) is returned. | |||
== Bugs == | |||
* When issuing a DMA request, GSP attempts to acquire an internal semaphore that rules CDMA access; this semaphore is never released on failure paths. While this is generally not an issue, as GSP breaks on DMA failures, it becomes a problem if the DMA request is done with cache flushing: in that case, GSP will error silently, causing a deadlock in DMA code. | |||
* When handling GX commands apart from RequestDMA and ProcessCommandList, GSP sets the relative busy flags in internal state before executing the commands. This means that, if the relevant interrupts are never triggered (eg. on invalid parameters), the busy flags never get reset, preventing execution of future commands of the same kind. |