mbox series

[FFmpeg-devel,v3,0/2] lavc/aarch64/fdct: add neon-optimized fdct for aarch64

Message ID 20240417180138.21864-1-ramiro.polla@gmail.com
Headers show
Series lavc/aarch64/fdct: add neon-optimized fdct for aarch64 | expand

Message

Ramiro Polla April 17, 2024, 6:01 p.m. UTC
This patch set adds fdct to checkasm and neon-optimized fdct for aarch64.

Ramiro Polla (2):
  checkasm: add test for fdct
  lavc/aarch64/fdct: add neon-optimized fdct for aarch64

 libavcodec/aarch64/Makefile               |   2 +
 libavcodec/aarch64/fdct.h                 |  26 ++
 libavcodec/aarch64/fdctdsp_init_aarch64.c |  39 +++
 libavcodec/aarch64/fdctdsp_neon.S         | 368 ++++++++++++++++++++++
 libavcodec/avcodec.h                      |   1 +
 libavcodec/fdctdsp.c                      |   4 +-
 libavcodec/fdctdsp.h                      |   2 +
 libavcodec/options_table.h                |   1 +
 libavcodec/tests/aarch64/dct.c            |   2 +
 tests/checkasm/Makefile                   |   1 +
 tests/checkasm/checkasm.c                 |   3 +
 tests/checkasm/checkasm.h                 |   1 +
 tests/checkasm/fdctdsp.c                  |  68 ++++
 tests/fate/checkasm.mak                   |   1 +
 14 files changed, 518 insertions(+), 1 deletion(-)
 create mode 100644 libavcodec/aarch64/fdct.h
 create mode 100644 libavcodec/aarch64/fdctdsp_init_aarch64.c
 create mode 100644 libavcodec/aarch64/fdctdsp_neon.S
 create mode 100644 tests/checkasm/fdctdsp.c

Comments

Martin Storsjö April 17, 2024, 8:49 p.m. UTC | #1
On Wed, 17 Apr 2024, Ramiro Polla wrote:

> This patch set adds fdct to checkasm and neon-optimized fdct for aarch64.
>
> Ramiro Polla (2):
>  checkasm: add test for fdct
>  lavc/aarch64/fdct: add neon-optimized fdct for aarch64
>
> libavcodec/aarch64/Makefile               |   2 +
> libavcodec/aarch64/fdct.h                 |  26 ++
> libavcodec/aarch64/fdctdsp_init_aarch64.c |  39 +++
> libavcodec/aarch64/fdctdsp_neon.S         | 368 ++++++++++++++++++++++
> libavcodec/avcodec.h                      |   1 +
> libavcodec/fdctdsp.c                      |   4 +-
> libavcodec/fdctdsp.h                      |   2 +
> libavcodec/options_table.h                |   1 +
> libavcodec/tests/aarch64/dct.c            |   2 +
> tests/checkasm/Makefile                   |   1 +
> tests/checkasm/checkasm.c                 |   3 +
> tests/checkasm/checkasm.h                 |   1 +
> tests/checkasm/fdctdsp.c                  |  68 ++++
> tests/fate/checkasm.mak                   |   1 +
> 14 files changed, 518 insertions(+), 1 deletion(-)
> create mode 100644 libavcodec/aarch64/fdct.h
> create mode 100644 libavcodec/aarch64/fdctdsp_init_aarch64.c
> create mode 100644 libavcodec/aarch64/fdctdsp_neon.S
> create mode 100644 tests/checkasm/fdctdsp.c

LGTM, thanks!

// Martin
Ramiro Polla May 13, 2024, 12:55 p.m. UTC | #2
On Wed, Apr 17, 2024 at 10:49 PM Martin Storsjö <martin@martin.st> wrote:
> On Wed, 17 Apr 2024, Ramiro Polla wrote:
> > This patch set adds fdct to checkasm and neon-optimized fdct for aarch64.
> >
> > Ramiro Polla (2):
> >  checkasm: add test for fdct
> >  lavc/aarch64/fdct: add neon-optimized fdct for aarch64
> >
> > libavcodec/aarch64/Makefile               |   2 +
> > libavcodec/aarch64/fdct.h                 |  26 ++
> > libavcodec/aarch64/fdctdsp_init_aarch64.c |  39 +++
> > libavcodec/aarch64/fdctdsp_neon.S         | 368 ++++++++++++++++++++++
> > libavcodec/avcodec.h                      |   1 +
> > libavcodec/fdctdsp.c                      |   4 +-
> > libavcodec/fdctdsp.h                      |   2 +
> > libavcodec/options_table.h                |   1 +
> > libavcodec/tests/aarch64/dct.c            |   2 +
> > tests/checkasm/Makefile                   |   1 +
> > tests/checkasm/checkasm.c                 |   3 +
> > tests/checkasm/checkasm.h                 |   1 +
> > tests/checkasm/fdctdsp.c                  |  68 ++++
> > tests/fate/checkasm.mak                   |   1 +
> > 14 files changed, 518 insertions(+), 1 deletion(-)
> > create mode 100644 libavcodec/aarch64/fdct.h
> > create mode 100644 libavcodec/aarch64/fdctdsp_init_aarch64.c
> > create mode 100644 libavcodec/aarch64/fdctdsp_neon.S
> > create mode 100644 tests/checkasm/fdctdsp.c
>
> LGTM, thanks!

Pushed.