Ryujinx

mirror of https://git.naxdy.org/Mirror/Ryujinx.git synced 2024-12-27 11:03:04 +00:00

Author	SHA1	Message	Date
gdkchan	c0f2491eae	Vulkan separate descriptor set fixes (#6895 ) * Ensure descriptor sets are only re-used when all command buffers using it have completed * Fix some SPIR-V capabilities * Set update after bind flag if we exceed limits * Simpler fix for Intel * Format whitespace * Make struct readonly * Add barriers for extra set arrays too	2024-06-02 22:40:28 -03:00
sunshineinabox	d7c6474729	GPU: Remove unused dynamic state and pipeline settings (#6796 ) * Dynamic state for Depth Bounds should not be passed to PipelineDynamicStateCreateInfo as the command to set them is never called. Do not pass pointer to viewport and scissor as those dynamic states should be supported on all devices. Same as above for DepthBias values. * Code Review Suggestion * Pipeline derivation is not implemented and is not suggested. * Depth Bounds are not used.	2024-06-02 22:32:10 -03:00
jhorv	1ecc8fbc3b	New pooled memory types (#6821 ) * feat: add new types MemoryOwner and SpanOwner * use SpanOwner instead of new array allocation * change for loop condition to `fences.Length` instead of `count` to elide Span boundary checks on `fences`	2024-06-02 22:24:14 -03:00
gdkchan	53d096e392	Allow texture arrays to use separate descriptor sets on Vulkan (#6870 ) * Report base and extra sets from the backend * Pass texture set index everywhere * Key textures using set and binding (rather than just binding) * Start using extra sets for array textures * Shader cache version bump * Separate new commands, some PR feedback * Introduce new manual descriptor set reservation method that prevents it from being used by something else while owned by an array * Move bind extra sets logic to new method * Should only use separate array is MaximumExtraSets is not zero * Format whitespace	2024-05-26 13:30:19 -03:00
Piplup	c98b7fc702	Workaround bug on logic op with float framebuffer (#6858 ) * intel workaround built on top of the amd workaround * forgot to update the note * Logic Change Enabled workaround for all vendors that aren't nvidia * Applied Suggestions	2024-05-23 22:57:26 -03:00
gdkchan	e65effcb05	Workaround AMD bug on logic op with float framebuffer (#6852 ) * Workaround AMD bug on logic op with float framebuffer * Format whitespace * Update comment	2024-05-23 01:05:32 -03:00
riperiperi	eb1ce41b00	GPU: Migrate buffers on GPU project, pre-emptively flush device local mappings (#6794 ) * GPU: Migrate buffers on GPU project, pre-emptively flush device local mappings Essentially retreading #4540, but it's on the GPU project now instead of the backend. This allows us to have a lot more control + knowledge of where the buffer backing has been changed and allows us to pre-emptively flush pages to host memory for quicker readback. It will allow us to do other stuff in the future, but we'll get there when we get there. Performance greatly improved in Hyrule Warriors: Age of Calamity. Performance notably improved in TOTK (average). Performance for BOTW restored to how it was before #4911, perhaps a bit better. - Rewrites a bunch of buffer migration stuff. Might want to tighten up how dispose stuff works. - Fixed an issue where the copy for texture pre-flush would happen _after_ the syncpoint. TODO: remove a page from pre-flush if it isn't flushed after a certain number of copies. * Add copy deactivation * Fix dependent virtual buffers * Remove logging * Fix format issues (maybe) * Vulkan: Remove backing swap * Add explicit memory access types for most buffers * Fix typo * Add device local force expiry, change buffer inheritance behaviour * General cleanup, OGL fix * BufferPreFlush comments * BufferBackingState comments * Add an extra precaution to BufferMigration This is very unlikely, but it's important to cover loose ends like this. * Address some feedback * Docs	2024-05-19 16:53:37 -03:00
gdkchan	3a3b51893e	Add support for bindless textures from storage buffer on Vulkan (#6721 ) * Halve primitive ID when converting quads to triangles * Shader cache version bump * Add support for bindless textures from storage buffer on Vulkan	2024-05-14 16:47:16 +02:00
gdkchan	c6f8bfed90	Add support for bindless textures from shader input (vertex buffer) on Vulkan (#6577 ) * Add support for bindless textures from shader input (vertex buffer) * Shader cache version bump * Format whitespace * Remove cache entries on pool removal, disable for OpenGL * PR feedback	2024-04-22 15:05:55 -03:00
Marco Carvalho	99f46e22e2	Do not compare Span<T> to 'null' or 'default' (#6683 )	2024-04-19 09:21:21 -03:00
jhorv	268c9aecf8	Texture loading: reduce memory allocations (#6623 ) * rebase * add methods Ryyjinx.Common EmbeddedResources and SteamUtils * GAL changes - change SetData() methods and ThreadedTexture commands to use IMemoryOwner<byte> instead of SpanOrArray<byte> * Ryujinx.Graphics.Texture: change texture conversion methods to return IMemoryOwner<byte> and allocate from ByteMemoryPool * Ryujinx.Graphics.OpenGL: update ITexture and Texture-like types with SetData() methods to take IMemoryOwner<byte> instead of SpanOrArray<byte> * Ryujinx.Graphics.Vulkan: update ITexture and Texture-like types with SetData() methods to take IMemoryOwner<byte> instead of SpanOrArray<byte> * Ryujinx.Graphics.Gpu: update ITexture and Texture-like types with SetData() methods to take IMemoryOwner<byte> instead of SpanOrArray<byte> * Remove now-unused SpanOrArray<T> * post-rebase cleanup * PixelConverter: remove unsafe modifier on safe methods, and remove one unnecessary cast * use ByteMemoryPool.Rent() in GetWritableRegion() impls * fix formatting, rename `ReadRentedMemory()` to `ReadFileToRentedMemory()`` * Texture.ConvertToHostCompatibleFormat(): dispose of `result` in Astc decode branch	2024-04-14 17:06:14 -03:00
gdkchan	e916662b0f	Account for swapchain image count change after re-creation (#6652 )	2024-04-11 17:24:19 -03:00
gdkchan	3e6e0e4afa	Add support for large sampler arrays on Vulkan (#6489 ) * Add support for large sampler arrays on Vulkan * Shader cache version bump * Format whitespace * Move DescriptorSetManager to PipelineLayoutCacheEntry to allow different pool sizes per layout * Handle array textures with different types on the same buffer * Somewhat better caching system * Avoid useless buffer data modification checks * Move redundant bindings update checking to the backend * Fix an issue where texture arrays would get the same bindings across stages on Vulkan * Backport some fixes from part 2 * Fix typo * PR feedback * Format whitespace * Add some missing XML docs	2024-04-07 18:25:55 -03:00
gdkchan	3be616207d	Vulkan: Fix swapchain image view leak (#6509 )	2024-04-06 13:38:52 -03:00
gdkchan	791bf22109	Vulkan: Skip draws when patches topology is used without a tessellation shader (#6508 )	2024-04-06 13:25:51 -03:00
MutantAura	7124d679fd	UI: Friendly driver name reporting. (#6530 ) * Implement friendly VkDriverID names for UI. * Capitalise NVIDIA * Prefer vendor name on macOS * Typo fix Co-authored-by: gdkchan <gab.dark.100@gmail.com> --------- Co-authored-by: gdkchan <gab.dark.100@gmail.com>	2024-03-27 14:55:34 -03:00
gdkchan	72bdc24db8	Disable push descriptors for Intel ARC GPUs on Windows (#6551 ) * Move some init logic out of PrintGpuInformation, then delete it * Disable push descriptors for Intel ARC on Windows * Re-add PrintGpuInformation just to show it in the log	2024-03-26 22:27:48 -03:00
Matt Heins	c94a73ec60	Updates the default value for BufferedQuery (#6351 ) AMD GPUs (possibly just RDNA 3) could hang with the previous value until the MaxQueryRetries was hit. Fix #6056 Co-authored-by: riperiperi <rhy3756547@hotmail.com>	2024-03-21 21:44:11 -03:00
standstaff	e19e7622a3	chore: remove repetitive words (#6500 ) Signed-off-by: standstaff <zhengxingru@yeah.net>	2024-03-16 19:49:54 +01:00
gdkchan	732db7581f	Consider Polygon as unsupported is triangle fans are unsupported on Vulkan (#6490 )	2024-03-14 19:46:57 -03:00
riperiperi	ba91f5d401	Vulkan: Properly reset barrier batch when splitting due to mismatching flags (#6345 ) Forgot to set the end variable here. Should stop it from crashing when this path is taken.	2024-02-22 10:43:22 +01:00
riperiperi	79f6c18a9b	Vulkan: Disable push descriptors on older NVIDIA GPUs (#6340 ) Disables push descriptors on older NVIDIA GPUs (10xx and below), since it is clearly broken beyond comprehension. The existing workaround wasn't good enough and a more thorough one will probably cost more performance than the feature gains. The workaround has been removed. Fixes #6331.	2024-02-21 23:52:13 -03:00
riperiperi	4f63782bac	Vulkan: Fix barrier batching past limit (#6339 ) If more than 16 barriers were queued at one time, the _queuedBarrierCount would no longer match the number of remaining barriers, because when breaking out of the loop consuming them it deleted all barriers, not just the 16 that were consumed. Should fix freezes that started occurring with #6240. Fixes issue #6338.	2024-02-21 23:41:08 -03:00
riperiperi	31ed061bea	Vulkan: Improve texture barrier usage, timing and batching (#6240 ) * WIP barrier batch * Add store op to image usage barrier * Dispose the barrier batch * Fix encoding? * Handle read and write on the load op barrier. Load op consumes read accesses but does not add one, as the only other operation that can read is another load. * Simplify null check * Insert barriers on program change in case stale bindings are reintroduced * Not sure how I messed this one up * Improve location of bindings barrier update This is also important for emergency deferred clear * Update src/Ryujinx.Graphics.Vulkan/BarrierBatch.cs Co-authored-by: Mary Guillemard <thog@protonmail.com> --------- Co-authored-by: Mary Guillemard <thog@protonmail.com>	2024-02-17 00:21:37 -03:00
riperiperi	4218311e6a	Vulkan: Use push descriptors for uniform bindings when possible (#6154 ) * Fix Push Descriptors * Use push descriptor templates * Use reserved bindings * Formatting * Disable when using MVK ("my heart will go on" starts playing as thousands of mac users shed a tear in unison) * Introduce limit on push descriptor binding number The bitmask used for updating push descriptors is ulong, so only 64 bindings can be tracked for now. * Address feedback * Fix logic for binding rejection Should only offset limit when reserved bindings are less than the requested one. * Workaround pascal and older nv bug * Add GPU number detection for nvidia * Only do workaround if it's valid to do so.	2024-02-16 21:41:30 -03:00
gdkchan	e37735ed26	Implement X8Z24 texture format (#6315 )	2024-02-15 19:06:26 -03:00
gdkchan	74fe814329	Remove Vulkan SubgroupSizeControl enablement code (#6317 )	2024-02-15 16:04:30 -03:00
gdkchan	6a8ac389e5	Fix mip offset/size for full 3D texture upload on Vulkan (#6294 )	2024-02-11 00:41:17 +01:00
gdkchan	609de33b0b	Implement BGR10A2 render target format (#6273 )	2024-02-08 19:52:38 +01:00
riperiperi	c94f0fbb83	Vulkan: Add Render Pass / Framebuffer Cache (#6182 ) * Vulkan: Add Render Pass / Framebuffer Cache Cache is owned by each texture view. - Window's way of getting framebuffer cache for swapchain images is really messy - it creates a TextureView out of just a vk image view, with invalid info and no storage. * Clear up limited use of alternate TextureView constructor * Formatting and messages * More formatting and messages I apologize for `_colorsCanonical[index]?.Storage?.InsertReadToWriteBarrier`, the compiler made me do it * Self review, change GetFramebuffer to GetPassAndFramebuffer * Avoid allocations on Remove for HashTableSlim * Member can be readonly * Generate texture create info for swapchain images * Improve hashcode * Remove format, samples, size and isDepthStencil when possible Tested in a number of games, seems fine. * Removed load op barriers These can be introduced later. * Reintroduce UpdateModifications Technically meant to be replaced by load op stuff.	2024-01-31 23:49:50 +01:00
gdkchan	b8d992e5a7	Allow skipping draws with broken pipeline variants on Vulkan (#5807 ) * Allow skipping draws with broken pipeline variants on Vulkan * Move IsLinked check to CreatePipeline * Restore throw on error behaviour for background compile * Can't remove SetAlphaTest pragmas yet * Double new line	2024-01-26 13:58:57 -03:00
Elijah	d7ec4308b4	Use driver name instead of vendor name in the status bar for Vulkan. (#6146 ) * Replace vendor id lookup with driver name * Create separate field for driver name, handle OpenGL * Document changes in VulkanPhysicalDevice.cs * Always display driver over vendor * Replace Vulkan 1.2 requirement with VK_KHR_driver_properties * Remove empty line * Remove redundant unsafe block * Apply suggestions from code review --------- Co-authored-by: Ac_K <Acoustik666@gmail.com>	2024-01-26 01:07:20 +01:00
riperiperi	795539bc82	Vulkan: Use staging buffer for temporary constants (#6168 ) * Vulkan: Use staging buffer for temporary constants Helper shaders and post processing effects typically need some parameters to tell them what to do, which we pass via constant buffers that are created and destroyed each time. This can vary in cost between different Vulkan drivers. It shows up on profiles on mesa and MoltenVK, so it's worth avoiding. Some games only do it once (BlitColor for present), others multiple times. It's also done for post processing filters and FSR upscaling, which creates two buffers. For mirrors, I added the ability to reserve a range on the staging buffer for use as any type of binding. This PR allows these constant buffers to be instead temporarily allocated on the staging buffer, skipping allocation and buffer management costs entirely. Two temporary allocations do remain: - DrawTexture, because it doesn't have access to the command buffer scope - Index buffer indirect conversion, because one of them is a storage buffer and thus is a little more complicated. There's a small cost in that the uniform buffer takes up more space due to alignment requirements. At worst that's 256 bytes (on a GTX 1070) but more modern GPUs should have a better time. Worth testing across different games and post effects to make sure they still work. * Use temporary buffer for ConvertIndexBufferIndirect * Simplify alignment passing for now * Fix shader params length for CopyIncompatibleFormats * Set data for helpershaders without overlap checks The data is in the staging buffer, so its usage range is guarded using that.	2024-01-25 19:29:53 +01:00
riperiperi	6575952432	Vulkan: Enumerate Query Pool properly (#6167 ) Turns out that ElementAt for Queue<T> runs the default implementation as it doesn't implement IList, which enumerates elements of the queue up to the given index. This code was creating `count` enumerators and iterating way more queue items than it needed to at higher counts. The solution is just to use one enumerator and break out of the loop when we get the count that we need. 3.5% of backend time was being spent _just_ enumerating at the usual spot in SMO.	2024-01-24 19:33:52 -03:00
riperiperi	331c07807f	Vulkan: Use templates for descriptor updates (#6014 ) * WIP: Descriptor template update * Make configurable * Wording * Simplify template creation * Whitespace * UTF-8 whatever * Leave only templated path, better template updater	2024-01-20 11:07:33 -03:00
riperiperi	bebd8eb822	Vulkan: Cache delegate for EndRenderPass (#6132 ) This prevents a small allocation each time this method is called. This is a top 3 SOH allocation during gameplay in most games, and eliminating it is pretty free.	2024-01-16 13:22:20 +01:00
gdkchan	1df6c07f78	Implement support for multi-range buffers using Vulkan sparse mappings (#5427 ) * Pass MultiRange to BufferManager * Implement support for multi-range buffers using Vulkan sparse mappings * Use multi-range for remaining buffers, delete old methods * Assume that more buffers are contiguous * Dispose multi-range buffers after they are removed from the list * Properly init BufferBounds for constant and storage buffers * Do not try reading zero bytes data from an unmapped address on the shader cache + PR feedback * Fix misaligned sparse buffer offsets * Null check can be simplified * PR feedback	2023-12-04 20:30:19 +01:00
TSRBerry	2989c163a8	editorconfig: Set default encoding to UTF-8 (#5793 ) * editorconfig: Add default charset * Change file encoding from UTF-8-BOM to UTF-8	2023-12-04 14:17:13 +01:00
Zoltan Csizmadia	29e192f241	Migrate to .NET 8 (#5887 ) * Change TargetFramework to net8.0 * Disable info messages * Fix warings * Disable additional analyzer messages * Fix typo * Add whitespace * Fix ref vs in warnings * Use explicit [In] on array parameters * No need to guard Remove with Contains * Use 'ArgumentOutOfRangeException.ThrowIf...' instead of explicitly throwing a new exception instance * Bump .NET SDK version * Enable JsonSerializerIsReflectionEnabledByDefault * Use 8.0.100 GA release * Bump System package versions --------- Co-authored-by: Zoltan Csizmadia <Zoltan.Csizmadia@vericast.com>	2023-11-15 17:41:31 +01:00
gdkchan	841dd56f4c	Implement copy dependency for depth and color textures (#4365 ) * Implement copy dependency for depth and color textures * Revert changes added because R32 <-> D32 copies were illegal * Restore depth alias matches	2023-10-31 19:00:39 -03:00
riperiperi	76b53e018a	GPU: Add fallback when textureGatherOffsets is not supported (#5792 ) * GPU: Add fallback when textureGatherOffsets is not supported. This PR adds a fallback for GPUs or APIs that don't support an equivalent to the method `textureGatherOffsets`, where each of the 4 gathered texels has an individual offset. This is done by reusing the existing code to handle non-const offsets for texture instructions, though it has also been corrected as there were a few implementation issues. MoltenVK reports support for this capability, and it didn't error when we initially released the MacOS build, but that has since changed. MVK still reports support, but spirv-cross has been fixed in a way that it _attempts_ to use this capability, but the metal compiler errors since it doesn't exist. Some other fixes: - textureGatherOffsets emulation has been changed significantly. It now uses 4 texture sample instructions (not gather), calculates a base texel (i=0 j=0) and adds the offsets onto it before converting into a tex coord. The final result is offset into a texel center, so it shouldn't be subject to interpolation, though this isn't perfect and could have some error with floating point formats with linear sampling. It is subject to texture wrap mode as it should be, which is why texelFetch was not used. - Maybe gather should be used here with component `w` (i=0, j=0), though this multiplies number of texels fetched by 4... The way it was doing this before _was_ wrong_, but doing it right would avoid issues with texel center precision. - textureGatherOffset (singular) now performs textureGather with the offset applied to the coords, rather than the slower fallback where each texel is fetched individually. * Increment shader cache version, remove unused arg * Use base texture size for gather coord offset. Implicit LOD for gather is not supported. * Use 4 texture gathers for offsets emulation Avoids issues with interpolation at cost of performance (not sure how bad this is) * Address Feedback	2023-10-20 15:05:09 +02:00
sunshineinabox	e768a54f17	Replace ReaderWriterLock with ReaderWriterLockSlim (#5785 ) * Replace ReaderWriterLock with ReaderWriterLockSlim * Resolve Feedback + Correct typo * Revert some unncessary logic	2023-10-12 18:11:15 +02:00
gdkchan	4744bde0e5	Reduce the amount of descriptor pool allocations on Vulkan (#5673 ) * Reduce the amount of descriptor pool allocations on Vulkan * Formatting * Slice can be simplified * Make GetDescriptorPoolSizes static * Adjust CanFit calculation so that TryAllocateDescriptorSets never fails * Remove unused field	2023-09-26 02:00:02 +02:00
gdkchan	4a835bb2b9	Make Vulkan memory allocator actually thread safe (#5575 ) * Make Vulkan memory allocator actually thread safe * Make free thread safe too * PR feedback	2023-09-26 01:50:06 +02:00
Isaac Marovitz	d9f9bbfaa6	Vulkan: Fix barriers on macOS (#5700 ) * Use old method on macOS * gdk suggestions * Update src/Ryujinx.Graphics.Vulkan/TextureStorage.cs Co-authored-by: gdkchan <gab.dark.100@gmail.com> * Update src/Ryujinx.Graphics.Vulkan/TextureStorage.cs Co-authored-by: gdkchan <gab.dark.100@gmail.com> --------- Co-authored-by: gdkchan <gab.dark.100@gmail.com>	2023-09-23 19:32:36 -03:00
gdkchan	7ccff037e8	Fix some Vulkan validation errors (mostly related to barriers) (#5603 ) * Replace image barriers inside render pass with more generic memory barrier * Remove forceStorage since it was creating images with storage bit for formats that are not StorageImage compatible * Add missing flags on subpass dependency * Don't call vkCmdSetScissor with a scissor count of 0 * One semaphore per swapchain image * Remove compute stage from read to write barriers * Try to improve Pipeline.Barrier nonsense * Set PipelineStateFlags based on supported stages	2023-09-14 19:58:11 +02:00
gdkchan	ddb6493896	Delete ResourceAccess (#5626 ) * Delete ResourceAccess * Set write flag for vertex/geometry as compute output buffers	2023-09-05 22:59:21 +02:00
riperiperi	93cd327873	Vulkan: Device Local and higher invocation count for buffer conversions (#5623 ) Just some simple changes to the buffer conversion shaders. (stride conversion, D32S8 to D24S8) The first change is using a device local buffer for converted vertex buffers, since they're only read/written on the GPU. These paths don't trigger on NVIDIA, but if you force them to use it demonstrates the full extent writing to host owned memory from compute absolutely destroys them. AMD GPUs are less heavily affected by this issue, but since the game in question was writing 230MB from compute, I imagine it should have some effect. The second change is allowing the buffer conversion shaders to scale their work group count. While dividing the work between 32 invocations works OK for M1 macs, it's not so great for anything with more cores like AMD GPUs, which should be able to do a lot more parallel copies. Now, it scales by roughly 100 elements per invocation. Some stride change cases could be improved further by either limiting vertex buffer size somehow (reading the index buffer could help, but is always risky) or only updating regions that changed, rather than invalidating the whole thing.	2023-09-02 17:58:15 -03:00
gdkchan	f09bba82b9	Geometry shader emulation for macOS (#5551 ) * Implement vertex and geometry shader conversion to compute * Call InitializeReservedCounts for compute too * PR feedback * Set clip distance mask for geometry and tessellation shaders too * Transform feedback emulation only for vertex	2023-08-29 21:10:34 -03:00
gdkchan	153b8bfc7c	Implement support for masked stencil clears on Vulkan (#5589 ) * Implement support for masked stencil clears on Vulkan * PR feedback	2023-08-18 05:25:54 +00:00

1 2

98 commits