rpcs3

mirror of https://github.com/RPCS3/rpcs3.git synced 2025-07-13 02:08:49 +12:00

Author	SHA1	Message	Date
kd-11	e1d1d16227	gl: Alias register binding points a bit - While aliasing is easy to break, it allows outdated hw to run	2021-10-10 16:15:28 +03:00
kd-11	20bd723e7c	rsx: Add floor workaround for GPUs with rounding issues - Mainly affects nvidia where x/w * w can sometimes return a value smaller than x. In such conditions, floor(x) will return x-1 if x is an integer which is horribly wrong and exaggerates minor precision drift to great proportions.	2021-06-09 10:55:55 +03:00
kd-11	39815801aa	rsx: Implement proper decoding for some obscure fragment instructions PK4UBG and UP4UBG were dropped from the NV_fragment_program spec in 2002. Not much information about them remains but seems pretty straightforward.	2021-06-05 21:02:14 +03:00
kd-11	c5a06dab0a	rsx: Refactor program texture state handling to be persistent across shader swaps	2021-05-15 23:51:12 +03:00
Megamouse	a50be7a912	GL: resharper findings (too lazy for const functions)	2021-04-30 08:23:16 +02:00
kd-11	eb086b0e3f	rsx: Add support shadow1D and shadowCube	2021-01-21 10:24:49 +03:00
Megamouse	838cbe1840	tiny include fix	2021-01-11 13:04:46 +03:00
kd-11	a96b4412d3	rsx: Do not rely on program env state, instead, always use program ucode analysis results when doing codegen - Some things can be present in program env but not ucode state e.g A texture can be active and bound in a redirected manner but not actually be used in ucode In such a case, only the ucode analysis or decompilation can decide whether to inject decoding routines	2020-12-25 02:39:08 +03:00
Nekotekina	fb29933d3d	Add usz alias for std::size_t	2020-12-18 12:23:53 +03:00
RipleyTom	af8c661a64	Remove BOM markers	2020-12-06 15:30:12 +03:00
kd-11	7553429130	gl: Thread shader source compilation dispatch - glCompileShader is in itself much slower than anticipated	2020-11-21 20:43:15 +03:00
kd-11	f4ec28d932	rsx: Merge instruction expand flag with the other sign expand flags - Avoids double expansion when both the exp_tex flag is set AND the texture also is sampled as signed - Should fix missing eyeballs in Mass Effect 1 with the previous sign expansion fix	2020-06-12 20:19:20 +03:00
kd-11	1677618c75	rsx: Implement stippled rendering	2020-05-30 14:47:10 +03:00
AniLeo	b0d3c4d75e	gl: Refactor shader type usage Use Common/GLSLTypes.h program_domain instead of duplicated own internal type	2020-05-16 16:16:17 +01:00
AniLeo	3db2f23e02	gl: Refactor shader compilation	2020-05-16 16:16:17 +01:00
kd-11	2985a39d2e	rsx: Rewrite async decompiler	2020-03-09 14:59:25 +03:00
Nekotekina	92e3eaf3ff	Fix signed-unsigned comparisons and mark warning as error (part 2).	2020-02-19 22:54:58 +03:00
Nekotekina	c0f80cfe7a	Use attributes for LIKELY/UNLIKELY Remove LIKELY/UNLIKELY macro.	2020-02-05 10:42:34 +03:00
Nekotekina	15391f45d0	Modernize RSX logging (rsx_log variable)	2020-02-01 11:52:22 +03:00
Maksim Derbasov	1abdee242a	small improvement (#7288 ) * small improvement * comments addressed Co-authored-by: kd-11 <15904127+kd-11@users.noreply.github.com>	2020-01-22 12:28:48 +00:00
kd-11	0b2f9f0f17	rsx: Add support for delayed shader discard. - Noticed a glitch on AMD hw and windows drivers where discard seems to affect entire 4x4 cells. - Dead fragments (outside the primitive boundary) could have their discards trigger as they do not have proper access to variables. - This introduces dead fragments along triangle edges, causing a diagonal line pattern across the screen that is very annoying.	2019-10-22 13:44:49 +03:00
kd-11	901942f24a	rsx: Replace pointless f32[4] restriction on texture parameters. - Use a struct instead to improve readability and remove pointless OpBitCast	2019-10-22 13:44:49 +03:00
kd-11	3c44065684	gl: Fix copypasta - MSAA is still unimplemented in OGL	2019-10-20 21:38:40 +03:00
kd-11	7aed9c3f13	gl: Add missing input declarations for 2-sided lighting	2019-09-30 21:52:43 +03:00
kd-11	eed32cf3a4	rsx: Decompiler fixups and improvements - Fix 2D coordinate sampling of W coordinate. W is actually HPOS.w and not 1. Z is however always 0. - Optimize register usage a bit Disassembling compiled SPV shows that global declaration results in less ops than using inout modifiers. Modifiers generate extra mov instructions.	2019-08-26 20:03:31 +03:00
kd-11	3e28e4b1e0	rsx/decompiler: Restructure program register behavior - Fix reading of varying registers in FP Different registers have different behavior - Always write to varying registers. If a register is not written to, it is initialized to (0, 0, 0, 1) - Reimplements two-sided lighting correctly without hacks - Also bumps shader cache version	2019-08-26 20:03:31 +03:00
kd-11	6be7c58fa4	glsl: Refactoring, cleanup and optimizations - Avoid generating unused code - Reduce GPR usage in emitted code	2019-06-25 20:50:54 +03:00
Lassi Hämäläinen	c963c51a60	Remove unnecessary header includes - Manually removed lot of unneeded #includes to clean code and reduce compilation time - Reordered some of the #includes to be in more logical order	2019-06-25 17:11:10 +03:00
scribam	370dcd9d6e	rsx: Apply Clang-Tidy fix "readability-simplify-subscript-expr"	2019-06-12 15:11:52 +03:00
scribam	44265aa27d	rsx: Apply Clang-Tidy fix "modernize-use-equals-default"	2019-06-12 15:11:52 +03:00
scribam	635695ac78	rsx: Apply Clang-Tidy fix "modernize-use-emplace"	2019-06-12 15:11:52 +03:00
scribam	8f2647555a	rsx: Apply Clang-Tidy fix "readability-redundant-string-init"	2019-06-12 15:11:52 +03:00
scribam	c4667133c4	gl/vk: Add constexpr to varying_registers and sync functions between the two backends	2019-06-12 10:59:31 +01:00
kd-11	60f3059d22	rsx: Compensate for nvidia's low precision attribute interpolation - The hw generates inaccurate values when doing perspective-correct interpolation of vertex output attributes and makes the comparison (a == b) fail even when they are a fixed constant value. - Increase equality tolerance when doing comparisons in fragment shaders for NV cards only to work around this issue. - Teepo fix	2019-04-25 16:23:05 +03:00
kd-11	463b1b220d	rsx: Improve accuracy of shadow compare Ops when non-integer depth formats are used - The fixed-point D24S8 format does special Z clamping during compare which matches PS3 behaviour - D32S8 is a floating point format and comparison with Dref > 1 always fails causing black edges/borders	2019-04-25 16:23:05 +03:00
kd-11	06a85f00d1	rsx: Shader decompiler cleanup and improvements - Improve support for float16_t by minimizing mixed inputs to functions (ambiguous overloads) - Minimize amount of downcasts in code by using opcode flags - Re-enable float16_t support for vulkan	2019-04-25 16:23:05 +03:00
kd-11	a668560c68	rsx: Use native half float types if available - Emulating f16 with f32 is not ideal and requires a lot of value clamping - Using native data type can significantly improve performance and accuracy - With openGL, check for the compatible extensions NV_gpu_shader5 and AMD_gpu_shader_half_float - With Vulkan, enable this functionality in the deviceFeatures if applicable. (VK_KHR_shader_float16_int8 extension) - Temporarily disable hw fp16 for vulkan	2019-04-25 16:23:05 +03:00
Nekotekina	bd9131ae1c	Implement fs::get_cache_dir Win32: equal to config dir for now Linux: respect XDG_CACHE_HOME if specified OSX: possibly incomplete	2019-01-13 14:45:36 +03:00
kd-11	4b79ef1ad9	rsx: Implement stencil mirror views - Implements a mirror view of D24S8 data that accesses the stencil components. Finishes the implementation of TEX2D_DEPTH_RGBA as the stencil component was previously missing from the reconstructed data - Add a few missing destructors Image classes are inherited a lot and I forgot to make the dtors virtual	2018-12-24 09:05:19 +03:00
kd-11	1ad76ad331	rsx: Restructure programs - Also re-enable pipeline optimizations	2018-11-30 23:51:25 +03:00
scribam	d7bb59cd99	c++17: use std::size	2018-09-06 13:15:59 +03:00
eladash	f349695a75	Rsx: rewrite address translation	2018-08-13 16:16:34 +03:00
eladash	3e433ef05c	create the shaderlog dir in Emu.Init()	2018-06-21 22:54:08 +04:00
scribam	04ad49de4d	typos	2018-05-14 21:14:39 +04:00
kd-11	321c360dcb	rsx: Overhaul rendertarget sampling/shuffles - Reimplements render target views used for sampling - Optimizes access using an encoded control token - Adds proper encoding for 24-bit textures (DRGB8 -> ORGB/OBGR) - Adds proper encoding for ABGR textures (ABGR8 -> ARGB8) - Silence some compiler warnings as well - TODO: Real texture views for OGL current method is a hack	2018-03-25 13:31:06 +03:00
kd-11	9fc1740608	rsx/fp: Fragment program overhaul - Separate TXB from TXL: They are completely different! - Properly perform TMU emulation in the fragment shader. Implemens SRGB conversion and alphakill at the moment - Properly perform ROP emulation in the fragment shader. Implements FRAMEBUFFER_SRGB. While support on the chip looks to be incomplete (and wierd), it does work - Document some more bits in SHADER_CONTROL register	2018-03-25 13:31:06 +03:00
kd-11	4804efc17d	rsx: Clear up confusion on depth writes. According to the NV_fragment_program spec, its not feasible to have 16-bit depth wries NOTE: NV_fragement_program precedes NV_fragment_program2 which is very close to what RSX consumes. It is hardware from that era afterall	2018-03-13 18:55:03 +03:00
Jake	3a9428a88e	rsx: Fix alpha test on VK/GL Moving alpha test outside of fp_main. This enforces alpha test is still done even if shader asm has early return	2018-03-09 18:06:20 +03:00
kd-11	33bcdd476c	glsl/fp/vp: Avoid shader clutter - Do not add unused subroutines in shaders unless necessary -- makes shaders easier to read and disassembled spir-v has less clutter - glsl: Replace switch block with lookup table	2018-01-30 21:16:43 +03:00
kd-11	320fb5f94c	rsx/fp: Fix alphakill	2017-12-22 20:08:14 +03:00

1 2 3

126 commits