[FFmpeg-devel,1/2] avutil/md5: fix misaligned reads

Message ID	20170303045420.5036-1-jamrial@gmail.com
State	Accepted
Commit	e2b7ae4b198c1dc001b3b28476608eaf4daf726c
Headers	show Delivered-To: ffmpegpatchwork@gmail.com Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; From: James Almer <jamrial@gmail.com> To: ffmpeg-devel@ffmpeg.org Date: Fri, 3 Mar 2017 01:54:19 -0300 Message-Id: <20170303045420.5036-1-jamrial@gmail.com> Subject: [FFmpeg-devel] [PATCH 1/2] avutil/md5: fix misaligned reads Precedence: list Reply-To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" <ffmpeg-devel-bounces@ffmpeg.org>

Message ID

20170303045420.5036-1-jamrial@gmail.com

State

Accepted

Commit

e2b7ae4b198c1dc001b3b28476608eaf4daf726c

Headers

Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org
	designates 79.124.17.100 as permitted sender)
	client-ip=79.124.17.100; 
From: James Almer <jamrial@gmail.com>
To: ffmpeg-devel@ffmpeg.org
Date: Fri,  3 Mar 2017 01:54:19 -0300
Message-Id: <20170303045420.5036-1-jamrial@gmail.com>
Subject: [FFmpeg-devel] [PATCH 1/2] avutil/md5: fix misaligned reads
Precedence: list
Reply-To: FFmpeg development discussions and patches
	<ffmpeg-devel@ffmpeg.org>
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
Errors-To: ffmpeg-devel-bounces@ffmpeg.org
Sender: "ffmpeg-devel" <ffmpeg-devel-bounces@ffmpeg.org>

This makes ubsan happy and also considerably increases performance on big endian systems. Signed-off-by: James Almer <jamrial@gmail.com> --- libavutil/md5.c | 15 +++++---------- 1 file changed, 5 insertions(+), 10 deletions(-)

Comments

Michael Niedermayer March 3, 2017, 3:34 p.m. UTC | #1

On Fri, Mar 03, 2017 at 01:54:19AM -0300, James Almer wrote:
> This makes ubsan happy and also considerably increases performance on
> big endian systems.

please include the difference in speed in the commit message if you
tested (it sounds like you did)

should be fine otherwise

thx

[...]

James Almer March 3, 2017, 4:38 p.m. UTC | #2

On 3/3/2017 12:34 PM, Michael Niedermayer wrote:
> On Fri, Mar 03, 2017 at 01:54:19AM -0300, James Almer wrote:
>> This makes ubsan happy and also considerably increases performance on
>> big endian systems.
> 
> please include the difference in speed in the commit message if you
> tested (it sounds like you did)

I didn't, Nicolas George did some time ago back the first time i sent
this patch.
https://ffmpeg.org/pipermail/ffmpeg-devel/2016-February/190027.html

> 
> should be fine otherwise
> 
> thx

Pushed.

> 
> [...]
> 
> 
> 
> _______________________________________________
> ffmpeg-devel mailing list
> ffmpeg-devel@ffmpeg.org
> http://ffmpeg.org/mailman/listinfo/ffmpeg-devel
>

diff --git a/libavutil/md5.c b/libavutil/md5.c
index 8c36aa80c4..d3698dcb1d 100644
--- a/libavutil/md5.c
+++ b/libavutil/md5.c
@@ -86,14 +86,14 @@  static const uint32_t T[64] = { // T[i]= fabs(sin(i+1)<<32)
                                                                         \
         if (i < 32) {                                                   \
             if (i < 16)                                                 \
-                a += (d ^ (b & (c ^ d)))  + X[       i  & 15];          \
+                a += (d ^ (b & (c ^ d)))  + AV_RL32(X+(       i  & 15));\
             else                                                        \
-                a += ((d & b) | (~d & c)) + X[(1 + 5*i) & 15];          \
+                a += ((d & b) | (~d & c)) + AV_RL32(X+((1 + 5*i) & 15));\
         } else {                                                        \
             if (i < 48)                                                 \
-                a += (b ^ c ^ d)          + X[(5 + 3*i) & 15];          \
+                a += (b ^ c ^ d)          + AV_RL32(X+((5 + 3*i) & 15));\
             else                                                        \
-                a += (c ^ (b | ~d))       + X[(    7*i) & 15];          \
+                a += (c ^ (b | ~d))       + AV_RL32(X+((    7*i) & 15));\
         }                                                               \
         a = b + (a << t | a >> (32 - t));                               \
     } while (0)
@@ -112,11 +112,6 @@  static void body(uint32_t ABCD[4], uint32_t *src, int nblocks)
 
         X = src + n * 16;
 
-#if HAVE_BIGENDIAN
-        for (i = 0; i < 16; i++)
-            X[i] = av_bswap32(X[i]);
-#endif
-
 #if CONFIG_SMALL
         for (i = 0; i < 64; i++) {
             CORE(i, a, b, c, d);
@@ -173,7 +168,7 @@  void av_md5_update(AVMD5 *ctx, const uint8_t *src, int len)
     }
 
     end = src + (len & ~63);
-    if (HAVE_BIGENDIAN || (!HAVE_FAST_UNALIGNED && ((intptr_t)src & 3))) {
+    if (!HAVE_FAST_UNALIGNED && ((intptr_t)src & 3)) {
        while (src < end) {
            memcpy(ctx->block, src, 64);
            body(ctx->ABCD, (uint32_t *) ctx->block, 1);

[FFmpeg-devel,1/2] avutil/md5: fix misaligned reads

Commit Message

Comments

Patch