diff mbox series

[FFmpeg-devel] hevcdsp_idct_neon.S: Avoid unnecessary mov.

Message ID 20230726191527.1623-1-Reimar.Doeffinger@gmx.de
State Accepted
Commit dcff15692dff4c55827d640f1d5d07eb255a5a6a
Headers show
Series [FFmpeg-devel] hevcdsp_idct_neon.S: Avoid unnecessary mov. | expand

Checks

Context Check Description
yinshiyou/make_loongarch64 success Make finished
yinshiyou/make_fate_loongarch64 success Make fate finished
andriy/make_x86 success Make finished
andriy/make_fate_x86 success Make fate finished

Commit Message

Reimar Döffinger July 26, 2023, 7:15 p.m. UTC
From: Reimar Döffinger <Reimar.Doeffinger@gmx.de>

ret can be given an argument instead.
This is also consistent with how other assembler code
in FFmpeg does it.
---
 libavcodec/aarch64/hevcdsp_idct_neon.S | 6 ++----
 1 file changed, 2 insertions(+), 4 deletions(-)

Comments

Martin Storsjö July 26, 2023, 7:43 p.m. UTC | #1
On Wed, 26 Jul 2023, Reimar.Doeffinger@gmx.de wrote:

> From: Reimar Döffinger <Reimar.Doeffinger@gmx.de>
>
> ret can be given an argument instead.
> This is also consistent with how other assembler code
> in FFmpeg does it.
> ---
> libavcodec/aarch64/hevcdsp_idct_neon.S | 6 ++----
> 1 file changed, 2 insertions(+), 4 deletions(-)
>
> diff --git a/libavcodec/aarch64/hevcdsp_idct_neon.S b/libavcodec/aarch64/hevcdsp_idct_neon.S
> index b7f23386a4..f7142c939c 100644
> --- a/libavcodec/aarch64/hevcdsp_idct_neon.S
> +++ b/libavcodec/aarch64/hevcdsp_idct_neon.S
> @@ -617,8 +617,7 @@ function ff_hevc_idct_16x16_\bitdepth\()_neon, export=1
>
>         add              sp,  sp,  #640
> 
> -        mov             x30, x15
> -        ret
> +        ret             x15
> endfunc
> .endm
> 
> @@ -814,8 +813,7 @@ function ff_hevc_idct_32x32_\bitdepth\()_neon, export=1
> .endr
>
>         add             sp,  sp,  #2432
> -        mov             x30, x15
> -        ret
> +        ret             x15
> endfunc
> .endm

LGTM, assuming checkasm still passes.

// Martin
Reimar Döffinger July 27, 2023, 5:24 p.m. UTC | #2
> On 26 Jul 2023, at 21:43, Martin Storsjö <martin@martin.st> wrote:
> 
> On Wed, 26 Jul 2023, Reimar.Doeffinger@gmx.de wrote:
> 
>> From: Reimar Döffinger <Reimar.Doeffinger@gmx.de>
>> 
>> ret can be given an argument instead.
>> This is also consistent with how other assembler code
>> in FFmpeg does it.
>> ---
>> libavcodec/aarch64/hevcdsp_idct_neon.S | 6 ++----
>> 1 file changed, 2 insertions(+), 4 deletions(-)
>> 
>> diff --git a/libavcodec/aarch64/hevcdsp_idct_neon.S b/libavcodec/aarch64/hevcdsp_idct_neon.S
>> index b7f23386a4..f7142c939c 100644
>> --- a/libavcodec/aarch64/hevcdsp_idct_neon.S
>> +++ b/libavcodec/aarch64/hevcdsp_idct_neon.S
>> @@ -617,8 +617,7 @@ function ff_hevc_idct_16x16_\bitdepth\()_neon, export=1
>> 
>>        add              sp,  sp,  #640
>> -        mov             x30, x15
>> -        ret
>> +        ret             x15
>> endfunc
>> .endm
>> @@ -814,8 +813,7 @@ function ff_hevc_idct_32x32_\bitdepth\()_neon, export=1
>> .endr
>> 
>>        add             sp,  sp,  #2432
>> -        mov             x30, x15
>> -        ret
>> +        ret             x15
>> endfunc
>> .endm
> 
> LGTM, assuming checkasm still passes.

It does. Will push soon (on the assumption I still can...) if no objections.

Best regards,
Reimar
Reimar Döffinger July 29, 2023, 2:31 p.m. UTC | #3
> On 26 Jul 2023, at 21:15, reimar.doeffinger@gmx.de wrote:
> 
> From: Reimar Döffinger <Reimar.Doeffinger@gmx.de>
> 
> ret can be given an argument instead.
> This is also consistent with how other assembler code
> in FFmpeg does it.

Now pushed.
diff mbox series

Patch

diff --git a/libavcodec/aarch64/hevcdsp_idct_neon.S b/libavcodec/aarch64/hevcdsp_idct_neon.S
index b7f23386a4..f7142c939c 100644
--- a/libavcodec/aarch64/hevcdsp_idct_neon.S
+++ b/libavcodec/aarch64/hevcdsp_idct_neon.S
@@ -617,8 +617,7 @@  function ff_hevc_idct_16x16_\bitdepth\()_neon, export=1
 
         add              sp,  sp,  #640
 
-        mov             x30, x15
-        ret
+        ret             x15
 endfunc
 .endm
 
@@ -814,8 +813,7 @@  function ff_hevc_idct_32x32_\bitdepth\()_neon, export=1
 .endr
 
         add             sp,  sp,  #2432
-        mov             x30, x15
-        ret
+        ret             x15
 endfunc
 .endm