diff mbox series

[FFmpeg-devel,v4,1/2] libavcodec: Added DFPWM1a codec

Message ID 63a183b7-98e4-314a-9444-a6197edf18e5@gmail.com
State New
Headers show
Series [FFmpeg-devel,v4,1/2] libavcodec: Added DFPWM1a codec | expand

Checks

Context Check Description
yinshiyou/make_loongarch64 success Make finished
yinshiyou/make_fate_loongarch64 success Make fate finished

Commit Message

Jack Bruienne Feb. 27, 2022, 8:55 a.m. UTC
From the wiki page (https://wiki.vexatos.com/dfpwm):
> DFPWM (Dynamic Filter Pulse Width Modulation) is an audio codec
> created by Ben “GreaseMonkey” Russell in 2012, originally to be used
> as a voice codec for asiekierka's pixmess, a C remake of 64pixels.
> It is a 1-bit-per-sample codec which uses a dynamic-strength one-pole
> low-pass filter as a predictor. Due to the fact that a raw DPFWM decoding
> creates a high-pitched whine, it is often followed by some post-processing
> filters to make the stream more listenable.

It has recently gained popularity through the ComputerCraft mod for
Minecraft, which added support for audio through this codec, as well as
the Computronics expansion which preceeded the official support. These
both implement the slightly adjusted 1a version of the codec, which is
the version I have chosen for this patch.

This patch adds a new codec (with encoding and decoding) for DFPWM1a.
The codec sources are pretty simple: they use the reference codec with
a basic wrapper to connect it to the FFmpeg AVCodec system.

To clarify, the codec does not have a specific sample rate - it is
provided by the container (or user), which is typically 48000, but has
also been known to be 32768. The codec does not specify channel info
either, and it's pretty much always used with one mono channel.
However, since it appears that libavcodec expects both sample rate and
channel count to be handled by either the codec or container, I have
made the decision to allow multiple channels interleaved, which as far
as I know has never been used, but it works fine here nevertheless. The
accompanying raw format has a channels option to set this. (I expect
most users of this will not use multiple channels, but it remains an
option just in case.)

This patch will be highly useful to ComputerCraft developers who are
working with audio, as it is the standard format for audio, and there
are few user-friendly encoders out there, and even fewer decoders. It
will streamline the process for importing and listening to audio, 
replacing the need to write code or use tools that require very
specific input formats.

You may use the CraftOS-PC program (https://www.craftos-pc.cc) to test
out DFPWM playback. To use it, run the program and type this command:
"attach left speaker" Then run "speaker play <file.dfpwm>" for each file.
The app runs in a sandbox, so files have to be transferred in first;
the easiest way to do this is to simply drag the file on the window.
(Or copy files to the folder at https://www.craftos-pc.cc/docs/saves.)

Sample DFPWM files can be generated with an online tool at
https://music.madefor.cc. This is the current best way to encode DFPWM
files. Simply drag an audio file onto the page, and it will encode it,
giving a download link on the page.

I've made sure to update all of the docs as per Developer§7, and I've
tested it as per section 8. Test files encoded to DFPWM play correctly
in ComputerCraft, and other files that work in CC are correctly decoded.
I have also verified that corrupt files do not crash the decoder - this
should theoretically not be an issue as the result size is constant with
respect to the input size.

Changes since v3:
Added support for multiple interleaved channels, and cleaned up the
code a bunch.

Changes since v2:
I've found that the reference encoder has a few errors, and sounds
worse than the Java-based implementation that is used most often. I got
in contact with someone who knows DFPWM much better than I do, and I
worked with them to make a few adjustments that should improve the
audio quality. I also made sure that the output matches the Java
codec exactly, so it should have the exact same quality as other codecs.

Signed-off-by: Jack Bruienne <jackbruienne@gmail.com>
---
  Changelog                 |   1 +
  MAINTAINERS               |   1 +
  doc/general_contents.texi |   1 +
  libavcodec/Makefile       |   2 +
  libavcodec/allcodecs.c    |   2 +
  libavcodec/codec_desc.c   |   7 +++
  libavcodec/codec_id.h     |   1 +
  libavcodec/dfpwmdec.c     | 127 ++++++++++++++++++++++++++++++++++++++
  libavcodec/dfpwmenc.c     | 121 ++++++++++++++++++++++++++++++++++++
  libavcodec/utils.c        |   2 +
  libavcodec/version.h      |   2 +-
  11 files changed, 266 insertions(+), 1 deletion(-)
  create mode 100644 libavcodec/dfpwmdec.c
  create mode 100644 libavcodec/dfpwmenc.c

Comments

James Almer Feb. 27, 2022, 12:40 p.m. UTC | #1
On 2/27/2022 5:55 AM, Jack Bruienne wrote:
> 
>  From the wiki page (https://wiki.vexatos.com/dfpwm):
>> DFPWM (Dynamic Filter Pulse Width Modulation) is an audio codec
>> created by Ben “GreaseMonkey” Russell in 2012, originally to be used
>> as a voice codec for asiekierka's pixmess, a C remake of 64pixels.
>> It is a 1-bit-per-sample codec which uses a dynamic-strength one-pole
>> low-pass filter as a predictor. Due to the fact that a raw DPFWM decoding
>> creates a high-pitched whine, it is often followed by some 
>> post-processing
>> filters to make the stream more listenable.
> 
> It has recently gained popularity through the ComputerCraft mod for
> Minecraft, which added support for audio through this codec, as well as
> the Computronics expansion which preceeded the official support. These
> both implement the slightly adjusted 1a version of the codec, which is
> the version I have chosen for this patch.
> 
> This patch adds a new codec (with encoding and decoding) for DFPWM1a.
> The codec sources are pretty simple: they use the reference codec with
> a basic wrapper to connect it to the FFmpeg AVCodec system.
> 
> To clarify, the codec does not have a specific sample rate - it is
> provided by the container (or user), which is typically 48000, but has
> also been known to be 32768. The codec does not specify channel info
> either, and it's pretty much always used with one mono channel.
> However, since it appears that libavcodec expects both sample rate and
> channel count to be handled by either the codec or container, I have
> made the decision to allow multiple channels interleaved, which as far
> as I know has never been used, but it works fine here nevertheless. The
> accompanying raw format has a channels option to set this. (I expect
> most users of this will not use multiple channels, but it remains an
> option just in case.)
> 
> This patch will be highly useful to ComputerCraft developers who are
> working with audio, as it is the standard format for audio, and there
> are few user-friendly encoders out there, and even fewer decoders. It
> will streamline the process for importing and listening to audio, 
> replacing the need to write code or use tools that require very
> specific input formats.
> 
> You may use the CraftOS-PC program (https://www.craftos-pc.cc) to test
> out DFPWM playback. To use it, run the program and type this command:
> "attach left speaker" Then run "speaker play <file.dfpwm>" for each file.
> The app runs in a sandbox, so files have to be transferred in first;
> the easiest way to do this is to simply drag the file on the window.
> (Or copy files to the folder at https://www.craftos-pc.cc/docs/saves.)
> 
> Sample DFPWM files can be generated with an online tool at
> https://music.madefor.cc. This is the current best way to encode DFPWM
> files. Simply drag an audio file onto the page, and it will encode it,
> giving a download link on the page.
> 
> I've made sure to update all of the docs as per Developer§7, and I've
> tested it as per section 8. Test files encoded to DFPWM play correctly
> in ComputerCraft, and other files that work in CC are correctly decoded.
> I have also verified that corrupt files do not crash the decoder - this
> should theoretically not be an issue as the result size is constant with
> respect to the input size.
> 
> Changes since v3:
> Added support for multiple interleaved channels, and cleaned up the
> code a bunch.
> 
> Changes since v2:
> I've found that the reference encoder has a few errors, and sounds
> worse than the Java-based implementation that is used most often. I got
> in contact with someone who knows DFPWM much better than I do, and I
> worked with them to make a few adjustments that should improve the
> audio quality. I also made sure that the output matches the Java
> codec exactly, so it should have the exact same quality as other codecs.
> 
> Signed-off-by: Jack Bruienne <jackbruienne@gmail.com>
> ---
>   Changelog                 |   1 +
>   MAINTAINERS               |   1 +
>   doc/general_contents.texi |   1 +
>   libavcodec/Makefile       |   2 +
>   libavcodec/allcodecs.c    |   2 +
>   libavcodec/codec_desc.c   |   7 +++
>   libavcodec/codec_id.h     |   1 +
>   libavcodec/dfpwmdec.c     | 127 ++++++++++++++++++++++++++++++++++++++
>   libavcodec/dfpwmenc.c     | 121 ++++++++++++++++++++++++++++++++++++
>   libavcodec/utils.c        |   2 +
>   libavcodec/version.h      |   2 +-
>   11 files changed, 266 insertions(+), 1 deletion(-)
>   create mode 100644 libavcodec/dfpwmdec.c
>   create mode 100644 libavcodec/dfpwmenc.c

> diff --git a/Changelog b/Changelog
> index 5ad2cef..5170a6a 100644
> --- a/Changelog
> +++ b/Changelog
> @@ -4,6 +4,7 @@ releases are sorted from youngest to oldest.
>  version 5.1:
>  - dialogue enhance audio filter
>  - dropped obsolete XvMC hwaccel
> +- DFPWM audio encoder/decoder
>  
>  
>  version 5.0:
> diff --git a/MAINTAINERS b/MAINTAINERS
> index f33ccbd..57b6f33 100644
> --- a/MAINTAINERS
> +++ b/MAINTAINERS
> @@ -161,6 +161,7 @@ Codecs:
>    cscd.c                                Reimar Doeffinger
>    cuviddec.c                            Timo Rothenpieler
>    dca*                                  foo86
> +  dfpwm*                                Jack Bruienne
>    dirac*                                Rostislav Pehlivanov
>    dnxhd*                                Baptiste Coudurier
>    dolby_e*                              foo86
> diff --git a/doc/general_contents.texi b/doc/general_contents.texi
> index df1692c..14aeaed 100644
> --- a/doc/general_contents.texi
> +++ b/doc/general_contents.texi
> @@ -1194,6 +1194,7 @@ following image formats are supported:
>  @item CRI HCA                @tab     @tab X
>  @item Delphine Software International CIN audio  @tab     @tab  X
>      @tab Codec used in Delphine Software International games.
> +@item DFPWM                  @tab  X  @tab  X
>  @item Digital Speech Standard - Standard Play mode (DSS SP) @tab     @tab  X
>  @item Discworld II BMV Audio @tab     @tab  X
>  @item COOK                   @tab     @tab  X
> diff --git a/libavcodec/Makefile b/libavcodec/Makefile
> index 6076b4a..7474220 100644
> --- a/libavcodec/Makefile
> +++ b/libavcodec/Makefile
> @@ -289,6 +289,8 @@ OBJS-$(CONFIG_DERF_DPCM_DECODER)       += dpcm.o
>  OBJS-$(CONFIG_DIRAC_DECODER)           += diracdec.o dirac.o diracdsp.o diractab.o \
>                                            dirac_arith.o dirac_dwt.o dirac_vlc.o
>  OBJS-$(CONFIG_DFA_DECODER)             += dfa.o
> +OBJS-$(CONFIG_DFPWM_DECODER)           += dfpwmdec.o
> +OBJS-$(CONFIG_DFPWM_ENCODER)           += dfpwmenc.o
>  OBJS-$(CONFIG_DNXHD_DECODER)           += dnxhddec.o dnxhddata.o
>  OBJS-$(CONFIG_DNXHD_ENCODER)           += dnxhdenc.o dnxhddata.o
>  OBJS-$(CONFIG_DOLBY_E_DECODER)         += dolby_e.o dolby_e_parse.o kbdwin.o
> diff --git a/libavcodec/allcodecs.c b/libavcodec/allcodecs.c
> index d1e1019..c3a0c26 100644
> --- a/libavcodec/allcodecs.c
> +++ b/libavcodec/allcodecs.c
> @@ -437,6 +437,8 @@ extern const AVCodec ff_bmv_audio_decoder;
>  extern const AVCodec ff_cook_decoder;
>  extern const AVCodec ff_dca_encoder;
>  extern const AVCodec ff_dca_decoder;
> +extern const AVCodec ff_dfpwm_encoder;
> +extern const AVCodec ff_dfpwm_decoder;
>  extern const AVCodec ff_dolby_e_decoder;
>  extern const AVCodec ff_dsd_lsbf_decoder;
>  extern const AVCodec ff_dsd_msbf_decoder;
> diff --git a/libavcodec/codec_desc.c b/libavcodec/codec_desc.c
> index 725c687..81f3b3c 100644
> --- a/libavcodec/codec_desc.c
> +++ b/libavcodec/codec_desc.c
> @@ -3237,6 +3237,13 @@ static const AVCodecDescriptor codec_descriptors[] = {
>          .long_name = NULL_IF_CONFIG_SMALL("MSN Siren"),
>          .props     = AV_CODEC_PROP_INTRA_ONLY | AV_CODEC_PROP_LOSSY,
>      },
> +    {
> +        .id        = AV_CODEC_ID_DFPWM,
> +        .type      = AVMEDIA_TYPE_AUDIO,
> +        .name      = "dfpwm",
> +        .long_name = NULL_IF_CONFIG_SMALL("DFPWM (Dynamic Filter Pulse Width Modulation)"),
> +        .props     = AV_CODEC_PROP_LOSSY,
> +    },
>  
>      /* subtitle codecs */
>      {
> diff --git a/libavcodec/codec_id.h b/libavcodec/codec_id.h
> index ab265ec..3ffb9bd 100644
> --- a/libavcodec/codec_id.h
> +++ b/libavcodec/codec_id.h
> @@ -516,6 +516,7 @@ enum AVCodecID {
>      AV_CODEC_ID_HCA,
>      AV_CODEC_ID_FASTAUDIO,
>      AV_CODEC_ID_MSNSIREN,
> +    AV_CODEC_ID_DFPWM,
>  
>      /* subtitle codecs */
>      AV_CODEC_ID_FIRST_SUBTITLE = 0x17000,          ///< A dummy ID pointing at the start of subtitle codecs.
> diff --git a/libavcodec/dfpwmdec.c b/libavcodec/dfpwmdec.c
> new file mode 100644
> index 0000000..237c273
> --- /dev/null
> +++ b/libavcodec/dfpwmdec.c
> @@ -0,0 +1,127 @@
> +/*
> + * DFPWM decoder
> + * Copyright (c) 2022 Jack Bruienne
> + * Copyright (c) 2012, 2016 Ben "GreaseMonkey" Russell
> + *
> + * This file is part of FFmpeg.
> + *
> + * FFmpeg is free software; you can redistribute it and/or
> + * modify it under the terms of the GNU Lesser General Public
> + * License as published by the Free Software Foundation; either
> + * version 2.1 of the License, or (at your option) any later version.
> + *
> + * FFmpeg is distributed in the hope that it will be useful,
> + * but WITHOUT ANY WARRANTY; without even the implied warranty of
> + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
> + * Lesser General Public License for more details.
> + *
> + * You should have received a copy of the GNU Lesser General Public
> + * License along with FFmpeg; if not, write to the Free Software
> + * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA
> + */
> +
> +/**
> + * @file
> + * DFPWM1a decoder
> + */
> +
> +#include "libavutil/internal.h"
> +#include "avcodec.h"
> +#include "codec_id.h"
> +#include "internal.h"
> +
> +typedef struct {
> +    int fq, q, s, lt;
> +} DFPWMState;
> +
> +// DFPWM codec from https://github.com/ChenThread/dfpwm/blob/master/1a/
> +// Licensed in the public domain
> +
> +static void au_decompress(DFPWMState *state, int fs, int len, uint8_t *outbuf, uint8_t *inbuf)
> +{
> +    unsigned d;
> +    for (int i = 0; i < len; i++) {
> +        // get bits
> +        d = *(inbuf++);
> +        for (int j = 0; j < 8; j++) {
> +            int nq, lq, st, ns, ov;
> +            // set target
> +            int t = ((d&1) ? 127 : -128);
> +            d >>= 1;
> +
> +            // adjust charge
> +            nq = state->q + ((state->s * (t-state->q) + 512)>>10);
> +            if(nq == state->q && nq != t)
> +                nq += (t == 127 ? 1 : -1);
> +            lq = state->q;
> +            state->q = nq;
> +
> +            // adjust strength
> +            st = (t != state->lt ? 0 : 1023);
> +            ns = state->s;
> +            if(ns != st)
> +                ns += (st != 0 ? 1 : -1);
> +            if(ns < 8) ns = 8;
> +            state->s = ns;
> +
> +            // FILTER: perform antijerk
> +            ov = (t != state->lt ? (nq+lq+1)>>1 : nq);
> +
> +            // FILTER: perform LPF
> +            state->fq += ((fs*(ov-state->fq) + 0x80)>>8);
> +            ov = state->fq;
> +
> +            // output sample
> +            *(outbuf++) = ov + 128;
> +
> +            state->lt = t;
> +        }
> +    }
> +}
> +
> +static av_cold int dfpwm_dec_init(struct AVCodecContext *ctx)
> +{
> +    DFPWMState *state = ctx->priv_data;
> +
> +    state->fq = 0;
> +    state->q = 0;
> +    state->s = 0;
> +    state->lt = -128;
> +
> +    ctx->sample_fmt = AV_SAMPLE_FMT_U8;
> +    ctx->bits_per_raw_sample = 8;
> +
> +    return 0;
> +}
> +
> +static int dfpwm_dec_frame(struct AVCodecContext *ctx, void *data,
> +    int *got_frame, struct AVPacket *packet)
> +{
> +    DFPWMState *state = ctx->priv_data;
> +    AVFrame *frame = data;
> +    int ret;
> +    
> +    frame->nb_samples = packet->size * 8 / ctx->channels;

You need to abort if ctx->channels <= 0 in dfpwm_dec_init() now that 
this decoder doesn't set it and depends on the container (or the user) 
doing so. Otherwise this can end up in a division by 0.

> +    ret = ff_get_buffer(ctx, frame, 0);
> +    if (ret) {
> +        *got_frame = 0;
> +        return ret;
> +    }
> +
> +    au_decompress(state, 140, packet->size, frame->data[0], packet->data);
> +
> +    *got_frame = 1;
> +    return packet->size;
> +}
> +
> +const AVCodec ff_dfpwm_decoder = {
> +    .name           = "dfpwm",
> +    .long_name      = NULL_IF_CONFIG_SMALL("DFPWM1a audio"),
> +    .type           = AVMEDIA_TYPE_AUDIO,
> +    .id             = AV_CODEC_ID_DFPWM,
> +    .priv_data_size = sizeof(DFPWMState),
> +    .init           = dfpwm_dec_init,
> +    .decode         = dfpwm_dec_frame,
> +    .capabilities   = AV_CODEC_CAP_DR1,
> +    .caps_internal  = FF_CODEC_CAP_INIT_THREADSAFE,
> +};
> diff --git a/libavcodec/dfpwmenc.c b/libavcodec/dfpwmenc.c
> new file mode 100644
> index 0000000..02f2e64
> --- /dev/null
> +++ b/libavcodec/dfpwmenc.c
> @@ -0,0 +1,121 @@
> +/*
> + * DFPWM encoder
> + * Copyright (c) 2022 Jack Bruienne
> + * Copyright (c) 2012, 2016 Ben "GreaseMonkey" Russell
> + *
> + * This file is part of FFmpeg.
> + *
> + * FFmpeg is free software; you can redistribute it and/or
> + * modify it under the terms of the GNU Lesser General Public
> + * License as published by the Free Software Foundation; either
> + * version 2.1 of the License, or (at your option) any later version.
> + *
> + * FFmpeg is distributed in the hope that it will be useful,
> + * but WITHOUT ANY WARRANTY; without even the implied warranty of
> + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
> + * Lesser General Public License for more details.
> + *
> + * You should have received a copy of the GNU Lesser General Public
> + * License along with FFmpeg; if not, write to the Free Software
> + * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA
> + */
> +
> +/**
> + * @file
> + * DFPWM1a encoder
> + */
> +
> +#include "libavutil/internal.h"
> +#include "avcodec.h"
> +#include "codec_id.h"
> +#include "encode.h"
> +#include "internal.h"
> +
> +typedef struct {
> +    int fq, q, s, lt;
> +} DFPWMState;
> +
> +// DFPWM codec from https://github.com/ChenThread/dfpwm/blob/master/1a/
> +// Licensed in the public domain
> +
> +// note, len denotes how many compressed bytes there are (uncompressed bytes / 8).
> +static void au_compress(DFPWMState *state, int len, uint8_t *outbuf, uint8_t *inbuf)
> +{
> +    unsigned d = 0;
> +    for (int i = 0; i < len; i++) {
> +        for (int j = 0; j < 8; j++) {
> +            int nq, st, ns;
> +            // get sample
> +            int v = *(inbuf++) - 128;
> +            // set bit / target
> +            int t = (v > state->q || (v == state->q && v == 127) ? 127 : -128);
> +            d >>= 1;
> +            if(t > 0)
> +                d |= 0x80;
> +
> +            // adjust charge
> +            nq = state->q + ((state->s * (t-state->q) + 512)>>10);
> +            if(nq == state->q && nq != t)
> +                nq += (t == 127 ? 1 : -1);
> +            state->q = nq;
> +
> +            // adjust strength
> +            st = (t != state->lt ? 0 : 1023);
> +            ns = state->s;
> +            if(ns != st)
> +                ns += (st != 0 ? 1 : -1);
> +            if(ns < 8) ns = 8;
> +            state->s = ns;
> +
> +            state->lt = t;
> +        }
> +
> +        // output bits
> +        *(outbuf++) = d;
> +    }
> +}
> +
> +static av_cold int dfpwm_enc_init(struct AVCodecContext *ctx)
> +{
> +    DFPWMState *state = ctx->priv_data;
> +
> +    state->fq = 0;
> +    state->q = 0;
> +    state->s = 0;
> +    state->lt = -128;
> +
> +    ctx->bits_per_coded_sample = 1;
> +
> +    return 0;
> +}
> +
> +static int dfpwm_enc_frame(struct AVCodecContext *ctx, struct AVPacket *packet,
> +    const struct AVFrame *frame, int *got_packet)
> +{
> +    DFPWMState *state = ctx->priv_data;
> +    int size = frame->nb_samples * frame->channels / 8 + (frame->nb_samples % 8 > 0 ? 1 : 0);
> +    int ret = ff_get_encode_buffer(ctx, packet, size, 0);
> +
> +    if (ret) {
> +        *got_packet = 0;
> +        return ret;
> +    }
> +
> +    au_compress(state, size, packet->data, frame->data[0]);
> +
> +    *got_packet = 1;
> +    return 0;
> +}
> +
> +const AVCodec ff_dfpwm_encoder = {
> +    .name            = "dfpwm",
> +    .long_name       = NULL_IF_CONFIG_SMALL("DFPWM1a audio"),
> +    .type            = AVMEDIA_TYPE_AUDIO,
> +    .id              = AV_CODEC_ID_DFPWM,
> +    .priv_data_size  = sizeof(DFPWMState),
> +    .init            = dfpwm_enc_init,
> +    .encode2         = dfpwm_enc_frame,
> +    .sample_fmts     = (const enum AVSampleFormat[]){AV_SAMPLE_FMT_U8, AV_SAMPLE_FMT_NONE},
> +    .capabilities    = AV_CODEC_CAP_DR1 | AV_CODEC_CAP_VARIABLE_FRAME_SIZE,
> +    .caps_internal   = FF_CODEC_CAP_INIT_THREADSAFE,
> +};
> diff --git a/libavcodec/utils.c b/libavcodec/utils.c
> index 6f9d90a..066da76 100644
> --- a/libavcodec/utils.c
> +++ b/libavcodec/utils.c
> @@ -577,6 +577,8 @@ enum AVCodecID av_get_pcm_codec(enum AVSampleFormat fmt, int be)
>  int av_get_bits_per_sample(enum AVCodecID codec_id)
>  {
>      switch (codec_id) {
> +    case AV_CODEC_ID_DFPWM:
> +        return 1;
>      case AV_CODEC_ID_ADPCM_SBPRO_2:
>          return 2;
>      case AV_CODEC_ID_ADPCM_SBPRO_3:
> diff --git a/libavcodec/version.h b/libavcodec/version.h
> index d900503..84f3979 100644
> --- a/libavcodec/version.h
> +++ b/libavcodec/version.h
> @@ -28,7 +28,7 @@
>  #include "libavutil/version.h"
>  
>  #define LIBAVCODEC_VERSION_MAJOR  59
> -#define LIBAVCODEC_VERSION_MINOR  21
> +#define LIBAVCODEC_VERSION_MINOR  22
>  #define LIBAVCODEC_VERSION_MICRO 100
>  
>  #define LIBAVCODEC_VERSION_INT  AV_VERSION_INT(LIBAVCODEC_VERSION_MAJOR, \
>
Paul B Mahol Feb. 27, 2022, 12:40 p.m. UTC | #2
Please add check for <=0 channels in decoder.
diff mbox series

Patch

diff --git a/Changelog b/Changelog
index 5ad2cef..5170a6a 100644
--- a/Changelog
+++ b/Changelog
@@ -4,6 +4,7 @@  releases are sorted from youngest to oldest.
 version 5.1:
 - dialogue enhance audio filter
 - dropped obsolete XvMC hwaccel
+- DFPWM audio encoder/decoder
 
 
 version 5.0:
diff --git a/MAINTAINERS b/MAINTAINERS
index f33ccbd..57b6f33 100644
--- a/MAINTAINERS
+++ b/MAINTAINERS
@@ -161,6 +161,7 @@  Codecs:
   cscd.c                                Reimar Doeffinger
   cuviddec.c                            Timo Rothenpieler
   dca*                                  foo86
+  dfpwm*                                Jack Bruienne
   dirac*                                Rostislav Pehlivanov
   dnxhd*                                Baptiste Coudurier
   dolby_e*                              foo86
diff --git a/doc/general_contents.texi b/doc/general_contents.texi
index df1692c..14aeaed 100644
--- a/doc/general_contents.texi
+++ b/doc/general_contents.texi
@@ -1194,6 +1194,7 @@  following image formats are supported:
 @item CRI HCA                @tab     @tab X
 @item Delphine Software International CIN audio  @tab     @tab  X
     @tab Codec used in Delphine Software International games.
+@item DFPWM                  @tab  X  @tab  X
 @item Digital Speech Standard - Standard Play mode (DSS SP) @tab     @tab  X
 @item Discworld II BMV Audio @tab     @tab  X
 @item COOK                   @tab     @tab  X
diff --git a/libavcodec/Makefile b/libavcodec/Makefile
index 6076b4a..7474220 100644
--- a/libavcodec/Makefile
+++ b/libavcodec/Makefile
@@ -289,6 +289,8 @@  OBJS-$(CONFIG_DERF_DPCM_DECODER)       += dpcm.o
 OBJS-$(CONFIG_DIRAC_DECODER)           += diracdec.o dirac.o diracdsp.o diractab.o \
                                           dirac_arith.o dirac_dwt.o dirac_vlc.o
 OBJS-$(CONFIG_DFA_DECODER)             += dfa.o
+OBJS-$(CONFIG_DFPWM_DECODER)           += dfpwmdec.o
+OBJS-$(CONFIG_DFPWM_ENCODER)           += dfpwmenc.o
 OBJS-$(CONFIG_DNXHD_DECODER)           += dnxhddec.o dnxhddata.o
 OBJS-$(CONFIG_DNXHD_ENCODER)           += dnxhdenc.o dnxhddata.o
 OBJS-$(CONFIG_DOLBY_E_DECODER)         += dolby_e.o dolby_e_parse.o kbdwin.o
diff --git a/libavcodec/allcodecs.c b/libavcodec/allcodecs.c
index d1e1019..c3a0c26 100644
--- a/libavcodec/allcodecs.c
+++ b/libavcodec/allcodecs.c
@@ -437,6 +437,8 @@  extern const AVCodec ff_bmv_audio_decoder;
 extern const AVCodec ff_cook_decoder;
 extern const AVCodec ff_dca_encoder;
 extern const AVCodec ff_dca_decoder;
+extern const AVCodec ff_dfpwm_encoder;
+extern const AVCodec ff_dfpwm_decoder;
 extern const AVCodec ff_dolby_e_decoder;
 extern const AVCodec ff_dsd_lsbf_decoder;
 extern const AVCodec ff_dsd_msbf_decoder;
diff --git a/libavcodec/codec_desc.c b/libavcodec/codec_desc.c
index 725c687..81f3b3c 100644
--- a/libavcodec/codec_desc.c
+++ b/libavcodec/codec_desc.c
@@ -3237,6 +3237,13 @@  static const AVCodecDescriptor codec_descriptors[] = {
         .long_name = NULL_IF_CONFIG_SMALL("MSN Siren"),
         .props     = AV_CODEC_PROP_INTRA_ONLY | AV_CODEC_PROP_LOSSY,
     },
+    {
+        .id        = AV_CODEC_ID_DFPWM,
+        .type      = AVMEDIA_TYPE_AUDIO,
+        .name      = "dfpwm",
+        .long_name = NULL_IF_CONFIG_SMALL("DFPWM (Dynamic Filter Pulse Width Modulation)"),
+        .props     = AV_CODEC_PROP_LOSSY,
+    },
 
     /* subtitle codecs */
     {
diff --git a/libavcodec/codec_id.h b/libavcodec/codec_id.h
index ab265ec..3ffb9bd 100644
--- a/libavcodec/codec_id.h
+++ b/libavcodec/codec_id.h
@@ -516,6 +516,7 @@  enum AVCodecID {
     AV_CODEC_ID_HCA,
     AV_CODEC_ID_FASTAUDIO,
     AV_CODEC_ID_MSNSIREN,
+    AV_CODEC_ID_DFPWM,
 
     /* subtitle codecs */
     AV_CODEC_ID_FIRST_SUBTITLE = 0x17000,          ///< A dummy ID pointing at the start of subtitle codecs.
diff --git a/libavcodec/dfpwmdec.c b/libavcodec/dfpwmdec.c
new file mode 100644
index 0000000..237c273
--- /dev/null
+++ b/libavcodec/dfpwmdec.c
@@ -0,0 +1,127 @@ 
+/*
+ * DFPWM decoder
+ * Copyright (c) 2022 Jack Bruienne
+ * Copyright (c) 2012, 2016 Ben "GreaseMonkey" Russell
+ *
+ * This file is part of FFmpeg.
+ *
+ * FFmpeg is free software; you can redistribute it and/or
+ * modify it under the terms of the GNU Lesser General Public
+ * License as published by the Free Software Foundation; either
+ * version 2.1 of the License, or (at your option) any later version.
+ *
+ * FFmpeg is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+ * Lesser General Public License for more details.
+ *
+ * You should have received a copy of the GNU Lesser General Public
+ * License along with FFmpeg; if not, write to the Free Software
+ * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA
+ */
+
+/**
+ * @file
+ * DFPWM1a decoder
+ */
+
+#include "libavutil/internal.h"
+#include "avcodec.h"
+#include "codec_id.h"
+#include "internal.h"
+
+typedef struct {
+    int fq, q, s, lt;
+} DFPWMState;
+
+// DFPWM codec from https://github.com/ChenThread/dfpwm/blob/master/1a/
+// Licensed in the public domain
+
+static void au_decompress(DFPWMState *state, int fs, int len, uint8_t *outbuf, uint8_t *inbuf)
+{
+    unsigned d;
+    for (int i = 0; i < len; i++) {
+        // get bits
+        d = *(inbuf++);
+        for (int j = 0; j < 8; j++) {
+            int nq, lq, st, ns, ov;
+            // set target
+            int t = ((d&1) ? 127 : -128);
+            d >>= 1;
+
+            // adjust charge
+            nq = state->q + ((state->s * (t-state->q) + 512)>>10);
+            if(nq == state->q && nq != t)
+                nq += (t == 127 ? 1 : -1);
+            lq = state->q;
+            state->q = nq;
+
+            // adjust strength
+            st = (t != state->lt ? 0 : 1023);
+            ns = state->s;
+            if(ns != st)
+                ns += (st != 0 ? 1 : -1);
+            if(ns < 8) ns = 8;
+            state->s = ns;
+
+            // FILTER: perform antijerk
+            ov = (t != state->lt ? (nq+lq+1)>>1 : nq);
+
+            // FILTER: perform LPF
+            state->fq += ((fs*(ov-state->fq) + 0x80)>>8);
+            ov = state->fq;
+
+            // output sample
+            *(outbuf++) = ov + 128;
+
+            state->lt = t;
+        }
+    }
+}
+
+static av_cold int dfpwm_dec_init(struct AVCodecContext *ctx)
+{
+    DFPWMState *state = ctx->priv_data;
+
+    state->fq = 0;
+    state->q = 0;
+    state->s = 0;
+    state->lt = -128;
+
+    ctx->sample_fmt = AV_SAMPLE_FMT_U8;
+    ctx->bits_per_raw_sample = 8;
+
+    return 0;
+}
+
+static int dfpwm_dec_frame(struct AVCodecContext *ctx, void *data,
+    int *got_frame, struct AVPacket *packet)
+{
+    DFPWMState *state = ctx->priv_data;
+    AVFrame *frame = data;
+    int ret;
+    
+    frame->nb_samples = packet->size * 8 / ctx->channels;
+    ret = ff_get_buffer(ctx, frame, 0);
+    if (ret) {
+        *got_frame = 0;
+        return ret;
+    }
+
+    au_decompress(state, 140, packet->size, frame->data[0], packet->data);
+
+    *got_frame = 1;
+    return packet->size;
+}
+
+const AVCodec ff_dfpwm_decoder = {
+    .name           = "dfpwm",
+    .long_name      = NULL_IF_CONFIG_SMALL("DFPWM1a audio"),
+    .type           = AVMEDIA_TYPE_AUDIO,
+    .id             = AV_CODEC_ID_DFPWM,
+    .priv_data_size = sizeof(DFPWMState),
+    .init           = dfpwm_dec_init,
+    .decode         = dfpwm_dec_frame,
+    .capabilities   = AV_CODEC_CAP_DR1,
+    .caps_internal  = FF_CODEC_CAP_INIT_THREADSAFE,
+};
diff --git a/libavcodec/dfpwmenc.c b/libavcodec/dfpwmenc.c
new file mode 100644
index 0000000..02f2e64
--- /dev/null
+++ b/libavcodec/dfpwmenc.c
@@ -0,0 +1,121 @@ 
+/*
+ * DFPWM encoder
+ * Copyright (c) 2022 Jack Bruienne
+ * Copyright (c) 2012, 2016 Ben "GreaseMonkey" Russell
+ *
+ * This file is part of FFmpeg.
+ *
+ * FFmpeg is free software; you can redistribute it and/or
+ * modify it under the terms of the GNU Lesser General Public
+ * License as published by the Free Software Foundation; either
+ * version 2.1 of the License, or (at your option) any later version.
+ *
+ * FFmpeg is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+ * Lesser General Public License for more details.
+ *
+ * You should have received a copy of the GNU Lesser General Public
+ * License along with FFmpeg; if not, write to the Free Software
+ * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA
+ */
+
+/**
+ * @file
+ * DFPWM1a encoder
+ */
+
+#include "libavutil/internal.h"
+#include "avcodec.h"
+#include "codec_id.h"
+#include "encode.h"
+#include "internal.h"
+
+typedef struct {
+    int fq, q, s, lt;
+} DFPWMState;
+
+// DFPWM codec from https://github.com/ChenThread/dfpwm/blob/master/1a/
+// Licensed in the public domain
+
+// note, len denotes how many compressed bytes there are (uncompressed bytes / 8).
+static void au_compress(DFPWMState *state, int len, uint8_t *outbuf, uint8_t *inbuf)
+{
+    unsigned d = 0;
+    for (int i = 0; i < len; i++) {
+        for (int j = 0; j < 8; j++) {
+            int nq, st, ns;
+            // get sample
+            int v = *(inbuf++) - 128;
+            // set bit / target
+            int t = (v > state->q || (v == state->q && v == 127) ? 127 : -128);
+            d >>= 1;
+            if(t > 0)
+                d |= 0x80;
+
+            // adjust charge
+            nq = state->q + ((state->s * (t-state->q) + 512)>>10);
+            if(nq == state->q && nq != t)
+                nq += (t == 127 ? 1 : -1);
+            state->q = nq;
+
+            // adjust strength
+            st = (t != state->lt ? 0 : 1023);
+            ns = state->s;
+            if(ns != st)
+                ns += (st != 0 ? 1 : -1);
+            if(ns < 8) ns = 8;
+            state->s = ns;
+
+            state->lt = t;
+        }
+
+        // output bits
+        *(outbuf++) = d;
+    }
+}
+
+static av_cold int dfpwm_enc_init(struct AVCodecContext *ctx)
+{
+    DFPWMState *state = ctx->priv_data;
+
+    state->fq = 0;
+    state->q = 0;
+    state->s = 0;
+    state->lt = -128;
+
+    ctx->bits_per_coded_sample = 1;
+
+    return 0;
+}
+
+static int dfpwm_enc_frame(struct AVCodecContext *ctx, struct AVPacket *packet,
+    const struct AVFrame *frame, int *got_packet)
+{
+    DFPWMState *state = ctx->priv_data;
+    int size = frame->nb_samples * frame->channels / 8 + (frame->nb_samples % 8 > 0 ? 1 : 0);
+    int ret = ff_get_encode_buffer(ctx, packet, size, 0);
+
+    if (ret) {
+        *got_packet = 0;
+        return ret;
+    }
+
+    au_compress(state, size, packet->data, frame->data[0]);
+
+    *got_packet = 1;
+    return 0;
+}
+
+const AVCodec ff_dfpwm_encoder = {
+    .name            = "dfpwm",
+    .long_name       = NULL_IF_CONFIG_SMALL("DFPWM1a audio"),
+    .type            = AVMEDIA_TYPE_AUDIO,
+    .id              = AV_CODEC_ID_DFPWM,
+    .priv_data_size  = sizeof(DFPWMState),
+    .init            = dfpwm_enc_init,
+    .encode2         = dfpwm_enc_frame,
+    .sample_fmts     = (const enum AVSampleFormat[]){AV_SAMPLE_FMT_U8, AV_SAMPLE_FMT_NONE},
+    .capabilities    = AV_CODEC_CAP_DR1 | AV_CODEC_CAP_VARIABLE_FRAME_SIZE,
+    .caps_internal   = FF_CODEC_CAP_INIT_THREADSAFE,
+};
diff --git a/libavcodec/utils.c b/libavcodec/utils.c
index 6f9d90a..066da76 100644
--- a/libavcodec/utils.c
+++ b/libavcodec/utils.c
@@ -577,6 +577,8 @@  enum AVCodecID av_get_pcm_codec(enum AVSampleFormat fmt, int be)
 int av_get_bits_per_sample(enum AVCodecID codec_id)
 {
     switch (codec_id) {
+    case AV_CODEC_ID_DFPWM:
+        return 1;
     case AV_CODEC_ID_ADPCM_SBPRO_2:
         return 2;
     case AV_CODEC_ID_ADPCM_SBPRO_3:
diff --git a/libavcodec/version.h b/libavcodec/version.h
index d900503..84f3979 100644
--- a/libavcodec/version.h
+++ b/libavcodec/version.h
@@ -28,7 +28,7 @@ 
 #include "libavutil/version.h"
 
 #define LIBAVCODEC_VERSION_MAJOR  59
-#define LIBAVCODEC_VERSION_MINOR  21
+#define LIBAVCODEC_VERSION_MINOR  22
 #define LIBAVCODEC_VERSION_MICRO 100
 
 #define LIBAVCODEC_VERSION_INT  AV_VERSION_INT(LIBAVCODEC_VERSION_MAJOR, \