Ryujinx

mirror of https://git.naxdy.org/Mirror/Ryujinx.git synced 2025-02-18 15:23:36 +00:00

Author	SHA1	Message	Date
gdkchan	3c3bcd82fe	Add a sampler pool cache and improve texture pool cache (#3487 ) * Add a sampler pool cache and improve texture pool cache * Increase disposal timestamp delta more to be on the safe side * Nits * Use abstract class for PoolCache, remove factory callback	2022-07-27 21:07:48 -03:00
gdkchan	1a888ae087	Add support for conditional (with CC) shader Exit instructions (#3470 ) * Add support for conditional (with CC) shader Exit instructions * Shader cache version bump * Make CSM conditions default to false for EXIT.CC	2022-07-24 15:33:30 -03:00
gdkchan	7f8a3541eb	Fix decoding of block after shader BRA.CC instructions without predicate (#3472 ) * Fix decoding of block after BRA.CC instructions without predicate * Shader cache version bump	2022-07-23 11:53:14 -03:00
gdkchan	b34de74f81	Avoid adding shader buffer descriptors for constant buffers that are not used (#3478 ) * Avoid adding shader buffer descriptors for constant buffers that are not used * Shader cache version	2022-07-23 11:15:58 -03:00
riperiperi	5811d121df	Avoid scaling 2d textures that could be used as 3d (#3464 )	2022-07-15 09:24:13 -03:00
Logan Stromberg	6eb85e846f	Reduce some unnecessary allocations in DMA handler (#2886 ) * experimental changes to try and reduce allocations in kernel threading and DMA handler * Simplify the changes in this branch to just 1. Don't make unnecessary copies of data just for texture-texture transfers and 2. Add a fast path for 1bpp linear byte copies * forgot to check src + dst linearity in 1bpp DMA fast path. Fixes the UE4 regression. * removing dev log I left in * Generalizing the DMA linear fast path to cases other than 1bpp copies * revert kernel changes * revert whitespace * remove unneeded references * PR feedback Co-authored-by: Logan Stromberg <lostromb@microsoft.com> Co-authored-by: gdk <gab.dark.100@gmail.com>	2022-07-14 15:45:56 -03:00
gdkchan	4523a73f75	Propagate Shader phi nodes with the same source value from all blocks (#3457 ) * Propagate Shader phi nodes with the same source value from all incoming blocks * Shader cache version bump	2022-07-12 00:36:58 +02:00
gdkchan	b46b63e06a	Add support for alpha to coverage dithering (#3069 ) * Add support for alpha to coverage dithering * Shader cache version bump * Fix wrong alpha register * Ensure support buffer is cleared * New shader specialization based approach	2022-07-05 19:58:36 -03:00
gdkchan	5afd521c5a	Bindless elimination for constant sampler handle (#3424 ) * Bindless elimination for constant sampler handle * Shader cache version bump * Update TextureHandle.ReadPackedId for new bindless elimination	2022-07-02 15:03:35 -03:00
gdkchan	625f5fb88a	Account for pool change on texture bindings cache (#3420 ) * Account for pool change on texture bindings cache * Reduce the number of checks needed	2022-06-25 16:52:38 +02:00
gdkchan	e747f5cd83	Ensure texture ID is valid before getting texture descriptor (#3406 )	2022-06-24 02:41:57 +02:00
riperiperi	68f9091870	Account for res scale changes when updating bindings (#3403 ) Fixes a regression introduced by the texture bindings PR. Also renames TextureStatePerStage, as it's no longer per stage.	2022-06-17 17:41:38 -03:00
riperiperi	99ffc061d3	Optimize Texture Binding and Shader Specialization Checks (#3399 ) * Changes 1 * Changes 2 * Better ModifiedSequence handling This should handle PreciseEvents properly, and simplifies a few things. * Minor changes, remove debug log * Handle stage.Info being null Hopefully fixes Catherine crash * Fix shader specialization fast texture lookup * Fix some things. * Address Feedback Part 1 * Make method static.	2022-06-17 13:09:14 -03:00
gdkchan	851f56b08a	Support Array/3D depth-stencil render target, and single layer clears (#3400 ) * Support Array/3D depth-stencil render target, and single layer clears * Alignment	2022-06-14 13:30:39 -03:00
gdkchan	9a9349f0f4	Fix instanced indexed inline draw index count (#3389 )	2022-06-10 23:44:49 -03:00
gdkchan	46cc7b55f0	Fix instanced indexed inline draws (#3383 )	2022-06-05 21:24:28 -03:00
gdkchan	a3e7bb8eb4	Copy dependency for multisample and non-multisample textures (#3382 ) * Use copy dependency for textures that differs in multisample but are otherwise compatible * Remove allowMs flag as it's no longer required for correctness, it's just an optimization now * Dispose intermmediate pool	2022-06-05 14:06:47 -03:00
Billy Laws	d03124a992	Fix 3D semaphore counter type 0 handling (#3380 ) Counter type 0 actually releases the semaphore payload rather than a constant zero as was previously thought. This is required by Skyrim.	2022-06-02 19:51:36 -03:00
Emmanuel Hansen	deb99d2cae	Avalonia UI - Part 1 (#3270 ) * avalonia part 1 * remove vulkan ui backend * move ui common files to ui common project * get name for oading screen from device * rebase. * review 1 * review 1.1 * review * cleanup * addressed review * use cancellation token * review * review * rebased * cancel library loading when closing window * remove star image, use fonticon instead * delete render control frame buffer when game ends. change position of fav star * addressed @Thog review * ensure the right ui is downloaded in updates * fix crash when showing not supported dialog during controller request * add prefix to artifact names * Auto-format Avalonia project * Fix input * Fix build, simplify app disposal * remove nv stutter thread * addressed review * add missing change * maintain window size if new size is zero length * add game, handheld, docked to local * reverse scale main window * Update de_DE.json * Update de_DE.json * Update de_DE.json * Update italian json * Update it_IT.json * let render timer poll with no wait * remove unused code * more unused code * enabled tiered compilation and trimming * check if window event is not closed before signaling * fix atmospher case * locale fix * locale fix * remove explicit tiered compilation declarations * Remove ) it_IT.json * Remove ) de_DE.json * Update it_IT.json * Update pt_BR locale with latest strings * Remove ')' * add more strings to locale * update locale * remove extra slash * remove extra slash * set firmware version to 0 if key's not found * fix * revert timer changes * lock on object instead * Update it_IT.json * remove unused method * add load screen text to locale * drop swap event * Update de_DE.json * Update de_DE.json * do null check when stopping emulator * Update de_DE.json * Create tr_TR.json * Add tr_TR * Add tr_TR + Turkish * Update it_IT.json * Update Ryujinx.Ava/Input/AvaloniaMappingHelper.cs Co-authored-by: Ac_K <Acoustik666@gmail.com> * Apply suggestions from code review Co-authored-by: Ac_K <Acoustik666@gmail.com> * Apply suggestions from code review Co-authored-by: Ac_K <Acoustik666@gmail.com> * addressed review * Update Ryujinx.Ava/Ui/Backend/OpenGl/OpenGlRenderTarget.cs Co-authored-by: gdkchan <gab.dark.100@gmail.com> * use avalonia's inbuilt renderer on linux * removed whitespace * workaround for queue render crash with vsync off * drop custom backend * format files * fix not closing issue * remove warnings * rebase * update avalonia library * Reposition the Text and Button on About Page * Assign build version * Remove appveyor text Co-authored-by: gdk <gab.dark.100@gmail.com> Co-authored-by: Niwu34 <67392333+Niwu34@users.noreply.github.com> Co-authored-by: Antonio Brugnolo <36473846+AntoSkate@users.noreply.github.com> Co-authored-by: aegiff <99728970+aegiff@users.noreply.github.com> Co-authored-by: Ac_K <Acoustik666@gmail.com> Co-authored-by: MostlyWhat <78652091+MostlyWhat@users.noreply.github.com>	2022-05-15 13:30:15 +02:00
riperiperi	9ba73ffbe5	Prefetch capabilities before spawning translation threads. (#3338 ) * Prefetch capabilities before spawning translation threads. The Backend Multithreading only expects one thread to submit commands at a time. When compiling shaders, the translator may request the host GPU capabilities from the backend. It's possible for a bunch of translators to do this at the same time. There's a caching mechanism in place so that the capabilities are only fetched once. By triggering this before spawning the thread, the async translation threads no longer try to queue onto the backend queue all at the same time. The Capabilities do need to be checked from the GPU thread, due to OpenGL needing a context to check them, so it's not possible to call the underlying backend directly. * Initialize the capabilities when setting the GPU thread + missing call in headless * Remove private variables	2022-05-14 11:58:33 -03:00
riperiperi	43b4b34376	Implement Viewport Transform Disable (#3328 ) * Initial implementation (no specialization) * Use specialization * Fix render scale, increase code gen version * Revert accidental change * Address Feedback	2022-05-12 10:47:13 -03:00
gdkchan	9eb5b7a10d	Restrict cases where vertex buffer size from index buffer type is used (#3304 )	2022-05-01 11:12:34 -03:00
riperiperi	d64594ec74	Fix various issues with texture sync (#3302 ) * Fix various issues with texture sync A variable called _actionRegistered is used to keep track of whether a tracking action has been registered for a given texture group handle. This variable is set when the action is registered, and should be unset when it is consumed. This is used to skip registering the tracking action if it's already registered, saving some time for render targets that are modified very often. There were two issues with this. The worst issue was that the tracking action handler exits early if the handle's modified flag is false... which means that it never reset _actionRegistered, as that was done within the Sync() method called later. The second issue was that this variable was set true after the sync action was registered, so it was technically possible for the action to run immediately, set the flag to false, then set it to true. Both situations would lead to the action never being registered again, as the texture group handle would be sure the action is already registered. This breaks the texture for the remaining runtime, or until it is disposed. It was also possible for a texture to register sync once, then on future frames the last modified sync number did not update. This may have caused some more minor issues. Seems to fix the Xenoblade flashing bug. Obviously this needs a lot of testing, since it was random chance. I typically had the most luck getting it to happen by switching time of day on the event theatre screen for a while, then entering the equipment screen by pressing X on an event. May also fix weird things like random chance air swimming in BOTW, maybe a few texture streaming bugs. * Exchange rather than CompareExchange	2022-04-29 18:34:11 -03:00
gdkchan	43ebd7a9bb	New shader cache implementation (#3194 ) * New shader cache implementation * Remove some debug code * Take transform feedback varying count into account * Create shader cache directory if it does not exist + fragment output map related fixes * Remove debug code * Only check texture descriptors if the constant buffer is bound * Also check CPU VA on GetSpanMapped * Remove more unused code and move cache related code * XML docs + remove more unused methods * Better codegen for TransformFeedbackDescriptor.AsSpan * Support migration from old cache format, remove more unused code Shader cache rebuild now also rewrites the shared toc and data files * Fix migration error with BRX shaders * Add a limit to the async translation queue Avoid async translation threads not being able to keep up and the queue growing very large * Re-create specialization state on recompile This might be required if a new version of the shader translator requires more or less state, or if there is a bug related to the GPU state access * Make shader cache more error resilient * Add some missing XML docs and move GpuAccessor docs to the interface/use inheritdoc * Address early PR feedback * Fix rebase * Remove IRenderer.CompileShader and IShader interface, replace with new ShaderSource struct passed to CreateProgram directly * Handle some missing exceptions * Make shader cache purge delete both old and new shader caches * Register textures on new specialization state * Translate and compile shaders in forward order (eliminates diffs due to different binding numbers) * Limit in-flight shader compilation to the maximum number of compilation threads * Replace ParallelDiskCacheLoader state changed event with a callback function * Better handling for invalid constant buffer 1 data length * Do not create the old cache directory structure if the old cache does not exist * Constant buffer use should be per-stage. This change will invalidate existing new caches (file format version was incremented) * Replace rectangle texture with just coordinate normalization * Skip incompatible shaders that are missing texture information, instead of crashing This is required if we, for example, support new texture instruction to the shader translator, and then they allow access to textures that were not accessed before. In this scenario, the old cache entry is no longer usable * Fix coordinates normalization on cubemap textures * Check if title ID is null before combining shader cache path * More robust constant buffer address validation on spec state * More robust constant buffer address validation on spec state (2) * Regenerate shader cache with one stream, rather than one per shader. * Only create shader cache directory during initialization * Logging improvements * Proper shader program disposal * PR feedback, and add a comment on serialized structs * XML docs for RegisterTexture Co-authored-by: riperiperi <rhy3756547@hotmail.com>	2022-04-10 10:49:44 -03:00
gdkchan	e44a43c7e1	Implement VMAD shader instruction and improve InvocationInfo and ISBERD handling (#3251 ) * Implement VMAD shader instruction and improve InvocationInfo and ISBERD handling * Shader cache version bump * Fix typo	2022-04-08 12:42:39 +02:00
gdkchan	3139a85a2b	Allow copy texture views to have mismatching multisample state (#3152 )	2022-04-08 11:26:48 +02:00
merry	a4e8bea866	Lop3Expression: Optimize expressions (#3184 ) * lut3 * bugfixes * TruthTable * false/true -> 0/-1 * add or to expressions * fix inversions * increment cache version	2022-04-08 11:17:38 +02:00
gdkchan	952f6f8a65	Calculate vertex buffer size from index buffer type (#3253 ) * Calculate vertex buffer size from index buffer type * We also need to update the size if first vertex changes	2022-04-08 11:02:06 +02:00
gdkchan	d4b960d348	Implement primitive restart draw arrays properly on OpenGL (#3256 )	2022-04-04 18:43:24 -03:00
gdkchan	b2a225558d	Do not force scissor on clear if scissor is disabled (#3258 )	2022-04-04 18:30:43 -03:00
gdkchan	1402d8391d	Support NVDEC H264 interlaced video decoding and VIC deinterlacing (#3225 ) * Support NVDEC H264 interlaced video decoding and VIC deinterlacing * Remove unused code	2022-03-23 17:09:32 -03:00
gdkchan	79408b68c3	De-tile GOB when DMA copying from block linear to pitch kind memory regions (#3207 ) * De-tile GOB when DMA copying from block linear to pitch kind memory regions * XML docs + nits * Remove using * No flush for regular buffer copies * Add back ulong casts, fix regression due to oversight	2022-03-20 13:55:07 -03:00
gdkchan	e5ad1dfa48	Implement S8D24 texture format and tweak depth range detection (#2458 )	2022-03-15 03:42:08 +01:00
gdkchan	79becc4b78	Dynamically increase buffer size when resizing (#2861 ) * Grow buffers by 1.5x of its size when resizing * Further restrict the cases where the dynamic expansion is done	2022-03-15 03:33:53 +01:00
gdkchan	0bcbe32367	Only initialize shader outputs that are actually used on the next stage (#3054 ) * Only initialize shader outputs that are actually used on the next stage * Shader cache version bump	2022-03-06 20:42:13 +01:00
gdkchan	0a24aa6af2	Allow textures to have their data partially mapped (#2629 ) * Allow textures to have their data partially mapped * Explicitly check for invalid memory ranges on the MultiRangeList * Update GetWritableRegion to also support unmapped ranges	2022-02-22 13:34:16 -03:00
riperiperi	c9c65af59e	Perform unscaled 2d engine copy on CPU if source texture isn't in cache. (#3112 ) * Initial implementation of fast 2d copy TODO: Partial copy for mismatching region/size. * WIP * Cleanup * Update Ryujinx.Graphics.Gpu/Engine/Twod/TwodClass.cs Co-authored-by: gdkchan <gab.dark.100@gmail.com> Co-authored-by: gdkchan <gab.dark.100@gmail.com>	2022-02-22 11:21:29 -03:00
Berkan Diler	644b497df1	Collapse AsSpan().Slice(..) calls into AsSpan(..) (#3145 ) * Collapse AsSpan().Slice(..) calls into AsSpan(..) Less code and a bit faster * Collapse an Array.Clear(array, 0, array.Length) call to Array.Clear(array)	2022-02-22 10:32:10 -03:00
gdkchan	72e543e946	Prefer texture over textureSize for sampler type (#3132 ) * Prefer texture over textureSize for sampler type * Shader cache version bump	2022-02-18 02:44:46 +01:00
gdkchan	3bd357045f	Do not allow render targets not explicitly written by the fragment shader to be modified (#3063 ) * Do not allow render targets not explicitly written by the fragment shader to be modified * Shader cache version bump * Remove blank lines * Avoid redundant color mask updates * HostShaderCacheEntry can be null * Avoid more redundant glColorMask calls * nit: Mask -> Masks * Fix currentComponentMask * More efficient way to update _currentComponentMasks	2022-02-16 23:15:39 +01:00
gdkchan	7bfb5f79b8	When copying linear textures, DMA should ignore region X/Y (#3121 )	2022-02-16 11:13:45 +01:00
Berkan Diler	8f35345729	Use Enum and Delegate.CreateDelegate generic overloads (#3111 ) * Use Enum generic overloads * Remove EnumExtensions.cs * Use Delegate.CreateDelegate generic overloads	2022-02-13 10:50:07 -03:00
gdkchan	f861f0bca2	Fix missing geometry shader passthrough inputs (#3106 ) * Fix missing geometry shader passthrough inputs * Shader cache version bump	2022-02-11 19:52:20 +01:00
Mary	6dffe0fad4	misc: Make PID unsigned long instead of long (#3043 )	2022-02-09 17:18:07 -03:00
gdkchan	b944941733	Fix bug that could cause depth buffer to be missing after clear (#3067 )	2022-01-31 00:11:43 -03:00
riperiperi	c52158b733	Add timestamp to 16-byte/4-word semaphore releases. (#3049 ) * Add timestamp to 16-byte semaphore releases. BOTW was reading a ulong 8 bytes after a semaphore return. Turns out this is the timestamp it was trying to do performance calculation with, so I've made it write when necessary. This mode was also added to the DMA semaphore I added recently, as it is required by a few games. (i think quake?) The timestamp code has been moved to GPU context. Check other games with an unusually low framerate cap or dynamic resolution to see if they have improved. * Cast dma semaphore payload to ulong to fill the space * Write timestamp first Might be just worrying too much, but we don't want the applcation reading timestamp if it sees the payload before timestamp is written.	2022-01-27 22:50:32 +01:00
riperiperi	fd6d3ec88f	Fix res scale parameters not being updated in vertex shader (#3046 ) This fixes an issue where the render scale array would not be updated when technically the scales on the flat array were the same, but the start index for the vertex scales was different.	2022-01-27 14:17:13 -03:00
gdkchan	42c75dbb8f	Add support for BC1/2/3 decompression (for 3D textures) (#2987 ) * Add support for BC1/2/3 decompression (for 3D textures) * Optimize and clean up * Unsafe not needed here * Fix alpha value interpolation when a0 <= a1	2022-01-22 19:23:00 +01:00
gdkchan	7e967d796c	Stop using glTransformFeedbackVaryings and use explicit layout on the shader (#3012 ) * Stop using glTransformFeedbackVarying and use explicit layout on the shader * This is no longer needed * Shader cache version bump * Fix gl_PerVertex output for tessellation control shaders	2022-01-21 12:35:21 -03:00
gdkchan	0e59573f2b	Add capability for BGRA formats (#3011 )	2022-01-20 08:37:21 -03:00

1 2 3 4 5 ...

393 commits