[原创] 记一次xxmain.so 从去花到魔改算法还原-Android安全-看雪-安全社区|安全招聘|kanxue.com

[原创] 记一次xxmain.so 从去花到魔改算法还原

发表于: 2024-9-23 00:14 36254

[原创] 记一次xxmain.so 从去花到魔改算法还原

劫__

2024-9-23 00:14

36254

本文章仅做移动安全学习交流用途严禁作其他用途

目标版本是8.55.6

目标算法是s***leSign 目标位置:0xbe11c

所用工具: IDA Pro, 010 Editor, unidbg, frida

把目标so拖进ida 让ida狠狠的分析这个so

当IDA分析完成后按快捷键G跳转到我们目标函数的位置

可以看到这个地方的都被识别成数据段了按C把它强制转成代码再按P把它定义成一个函数

正当我按下f5以为可以高枕无忧,狠狠分析的时候,显示的内容却让我傻了眼

发现有花指令[垃圾指令]的存在干扰了IDA的线性分析索性撂挑子不干了直接显示一个JUMPOUT

去到0xBE168看看怎么个事

可以看到当程序正常的执行流走到0xBE164处先进行了一个压栈的操作随后加载了一个DWORD存储到R0然后就跳转到了SUB_E9DE0函数,到这里IDA就飘红了

再去看看SUB_E9DE0函数:

简单分析一下可以看出这里貌似是在做某种运算?把传入的数值做某种运算后写入栈中,最终弹出PC寄存器使得程序的执行流去到某个地方

到这里大概可以看出 IDA之所以飘红的原因是因为IDA是线性反汇编这种类似于间接跳转的代码块因为缺少上下文 IDA并不知道这里去到哪里所以显示的JUMP OUT 从而达到对抗静态分析的目的

上面只是从ida来分析解决对抗花指令还得从动态执行来

先搭个unidbg架子

发现可以直接跑起来,不需要补环境还是很不错的

也取到了一份执行过程中的tracecode

使用010 Editor 打开trace文件转到0xbe164处

基于前面从ida中的简单分析可以看出在压栈操作后这段代码是把加载的一个跳转数x<<2后加上因为bl #0x400e9de0处而改变的lr寄存器的值得到一个最终的跳转地址从而改变程序执行流的位置

即 jump_addr = x * 2 + bl_addr + 4; bl_addr => bl指令所在的地址

注意到这个跳转代码块首尾有压栈出栈(恢复寄存器现场)的操作故可在压栈处直接改成直接跳转并不会影响寄存器现场

比如0xbe164处的汇编可以_修改为: B 0xc5490 _

这样就可以直接patch真实跳转地址从而让ida更好的反汇编

有了间接跳转块代码的分析就可以开始写ida python 愉快的去花了

从整个trace文件中搜索 push {r0, r1, lr} 发现共有8097处汇编代码且下方紧跟着的就是 ldr r0, [pc, #4]

那就可以根据这两行汇编作为间接跳转块的特征进行去花

因为ida把大部分汇编代码都识别成数据了一个一个按c去强转不太现实但不强转为汇编用ida python的api 获取当前地址的汇编代码又会发生错误

所以我决定使用capStone 对每一条汇编进行单独解析

在ida中执行脚本后控制台输出了每一个间接跳转块的真实跳转地址:

把这些真实跳转地址保存到一个patch_jump.txt文件

再写一段ida python脚本来patch每一个间接跳转点

保存patch完的so文件libxxmain_fix.so 再次拖入ida分析去到 0xBE164 处

可以看到此处的汇编代码已经变为直接跳转到目标地址了

再次f5

还是有一个jump out 是怎么回事呢跳到0xbedd4看看

原来是ida把此处识别为了arm汇编代码只要在0xbedd2处按d 转换为数据再在0xbedd4处按c 转为汇编即可

后续多个jumpout大多都是这个问题如法炮制即可再在ida左下角选择重新分析

经过多次修复 sub_BE11C 这个函数终于能看到一些jni细节了

把patch完成的so 放入unidbg中再次调用目标方法可以正常跑起来没有报错且结果与patch前一致说明patch并没有改变程序原来的执行流程图就不放了

当分析一个算法的具体过程的时候在入参不变的情况下输出的结果却一直变化这给分析算法过程增加了不少的工作量为了减小因结果变化而增加的工作量对结果进行固定无疑是最好的选择

程序在unidbg中跑起来后，发现即使固定函数的输入结果依然在变化猜测结果的内容应该与某些动态参数有关

随机数？时间？还是其他什么因素导致了结果的变化只需在unidbg中把对应的函数结果固定即可

固定结果后重新用unidbg取一份trace文件traceCode.txt

固定了结果后发现在时间戳的前面，正好是32个字符，猜测是某种哈希算法猜测是MD5哈希校验

从trace文件中小端序搜索结果的前8位(四个字节)

在0x12CD04处是结果第一次生成的地方且下方不远处也有结果2,3,4 跳转到ida看看

找一份md5实现代码:

既然知道了是md5哈希校验,在trace中搜索标准md5中用的魔数表,初始iv,都没有搜索到那大概率是改过iv和魔数表了接下来就是从trace中逆推出明文初始iv 魔数了

在trace文件中搜索循环右移(rors) 正好在结果的上方不远处就有一个rors 且这个循环右移是第128个循环右移从左边的搜索结果分布图来看搜索结果很集中说明程序的其他地方没有运用到循环右移

由上面md5哈希校验代码可知一次md5哈希校验有64轮计算其中一轮计算有一次循环左移而这里搜索出了128个循环右移猜测明文长度超过了64个字节或者进行了多次md5计算也可能是修改了计算次数

根据循环右移搜索的结果来看前64次逻辑左移与后64逻辑左移使用的偏移数与标准md5所使用的一致

在trace文件中找到第一次rors 往上找寻代码

利用上面的分析思路在整个trace文件中找出从M1至M128 如下:

再像上面一样从trace中取出整个码表

依据上面的初始模数与码表写一份magicMD5

前面分析了传入魔改md5的明文接下来研究明文如何生成的前面八个字节很明显是我们固定的时间从第九个字节开始分析

祭出龙哥的HexSearch大法

程序在0x10ec32处断下并且告知 0x4041c000处存放了搜索的数据

对这个地址进行tracewrite看看是哪里写入的这段明文

从日志中看出在0x1121d6处对地址写入了明文再到trace中搜索一下这个地址

发现与我们明文的一部分一致往上查找明文0x61从何处来的

往上寻找0x61生成可以看出这是直接从0x402351e0处加载一个字节得到的0x61 在0x1121d6 处下个断点看看0x402351e0这个地方存放的什么

0x402351e0处存放的似乎是一个码表猜测是经过某种运算得到一个偏移随后从码表中加载这个偏移处的字节作为明文

到IDA中看看这部分生成明文的代码

从ida的分析来看这里有一个进行了91次的循环正好对应了明文的91个字符 sub_2167DC函数返回的结果对应着明文位于码表中的偏移 hook一下看看

发现结果偏移的结果与明文对照码表中的结果一致

接下来看看这个偏移是如何生成的: hook sub_2167DC的第一个参数v2 得到 0xe4a
第二个参数固定为62不必多说是码表的长度

结合tarace汇编与伪c来看 sub_2167DC函数是调用sub_216744并传入参数a1与固定整数62 随后用a1-(得到的结果*62) = 码表中的偏移

即: 码表中的偏移 = x - (sub_216744(x) * 62)

查看sub_216744函数伪c 应该是某种自写的算法不分析了直接丢给gpt 让他转成python代码

接下来就是寻找入参v2的生成了已知hook到第一次入参为0xe4a 在trace中向上寻找生成

直接上代码

核心思路与前32位一致在分析前32位时发现有进行两次md5 结果的后32位也是md5

为节省篇幅长话短说:

其中a,c,d,f,g明文块都是简单异或运算在trace中就能搜到不作赘述

上方分析了md5明文的组成部分其中特别注意到明文块b 长度为32个字节猜测是sha256 在trace文件中搜索sha256校验中的标准魔数并未搜索到猜测是魔改过了

在trace文件中搜索疑似sha256结果的前四个字节看看在哪生成的

发现并未搜索出结果往上分析才发现原来b明文块是经过了异或0xcd运算后的将结果异或0xcd后再次搜索有了结果

跳转到ida看看

在ida中看到了非常明显的sha256计算特征

综上确定了这32字节的算法就是经过sha256校验后再异或0xcd

确定了是sha256算法先寻找sha256的明文

由sha256校验的密码学相关知识可知 sha256算法内核心步骤为:

利用bigSig0的旋转右移进行bigSig定位在trace文件中搜索旋转右移相关

直接搜索ror rors(旋转右移) 发现并没有相关结果猜测可能是变换了另外一种形式实现的旋转右移

在trace文件中搜索lsrs(逻辑右移) 发现有很多结果正则筛选一下 (lsrs*#2)

发现在0x15f7d4处有逻辑左移0x1e 往下看到有逻辑右移2 最终相加这样的运算等价于旋转右移

跳到ida看看

发现此处也有类似sha256计算的代码块经过简单分析可以看出伪代码中 v44就是当前参与计算的明文块

dword_22A0F4[**(_DWORD **)(v39 + 72)] 则是参与当前计算的k表

在trace文件中查看v44的赋值简单分析可以看出明文存放于0x4041c000处在0x15f76a处下个断点看看内存

与trace中分析得到的参与计算的明文一致此处就是sha256参与计算的明文了

至此sha256计算中的初向量 k表明文都已找到

找一份标准sha256代码更改其中的iv和k表尝试还原该魔改sha256算法

发现输出结果与上面分析md5明文的一致分析明文完成

明文的组成也比较简单就是一些简单异或在trace中均可分析出来就不作过多赘述了至此,整个算法的还原完成

样本难度中等综合了花指令魔改算法 ollvm混淆是个练手的好样本

因为混淆的存在不能过分依赖ida的f5进行静态分析在trace中进行算法还原是个不错的选择

小弟初接触此类混淆算法还原若有表述不当之处还望各位大牛批评指正,共同进步 Xiaoochenn_

希望文章对正在学习移动安全的伙伴有所帮助

感谢星球伙伴 @落叶的算法笔记

public class SecurityUtil extends AbstractJni {

    private final AndroidEmulator emulator;

    private final VM vm;

    private final Module module;

    private final DvmObject NativeLibHelper;
 
    SecurityUtil() {

        emulator = AndroidEmulatorBuilder.for32Bit()

        .setProcessName("xxxxx.android.xxxx")   //你懂的

        .addBackendFactory(new Unicorn2Factory(true))

        .build(); // 创建模拟器实例，要模拟32位或者64位，在这里区分
 
        final Memory memory = emulator.getMemory(); // 模拟器的内存操作接口

        memory.setLibraryResolver(new AndroidResolver(23)); // 设置系统类库解析

        vm = emulator.createDalvikVM();

        vm.setVerbose(true); // 设置是否打印Jni调用细节

        vm.setJni(this);

        new AndroidModule(emulator, vm).register(memory);
 
        DalvikModule dm = vm.loadLibrary(new File("libxxmain.so"), true); //

        module = dm.getModule();
 
        dm.callJNI_OnLoad(emulator); // 手动执行JNI_OnLoad函数

        NativeLibHelper = vm.resolveClass("xxxxx/android/xxxxxxxx/SecurityUtil").newObject(null);//你懂的

    }

    public void Sign(){

        String traceFile = "traceCode.txt";

        PrintStream traceStream;

        try {

            traceStream = new PrintStream(new FileOutputStream(traceFile), true);

            emulator.traceCode(module.base,module.base+module.size).setRedirect(traceStream);

        } catch (FileNotFoundException e) {

            e.printStackTrace();

        }

        byte[] bytes = {100,52,54,102,54,101,100,55,55,57,57,55,51,56,102,97,48,52,57,54,97,56,50,57,53,52,97,49,50,51,54,101};

        ByteArray barr = new ByteArray(vm,bytes);

        StringObject str = new StringObject(vm,"getdata");

        String stringObject = NativeLibHelper.callJniMethodObject(emulator,"s***leSign([BLjava/lang/String;)Ljava/lang/String;",barr,str).toString().replace("\"","");

        Inspector.inspect(stringObject.getBytes(StandardCharsets.UTF_8),"result");

        return;

    }

    public static void main(String[] args) {

        SecurityUtil securityUtil = new SecurityUtil();

        securityUtil.Sign();

    }
}

public class SecurityUtil extends AbstractJni {

private final AndroidEmulator emulator;

private final VM vm;

private final Module module;

private final DvmObject NativeLibHelper;

SecurityUtil() {

emulator = AndroidEmulatorBuilder.for32Bit()

.setProcessName("xxxxx.android.xxxx") //你懂的

.addBackendFactory(new Unicorn2Factory(true))

.build(); // 创建模拟器实例，要模拟32位或者64位，在这里区分

final Memory memory = emulator.getMemory(); // 模拟器的内存操作接口

memory.setLibraryResolver(new AndroidResolver(23)); // 设置系统类库解析

vm = emulator.createDalvikVM();

vm.setVerbose(true); // 设置是否打印Jni调用细节

vm.setJni(this);

new AndroidModule(emulator, vm).register(memory);

DalvikModule dm = vm.loadLibrary(new File("libxxmain.so"), true); //

module = dm.getModule();

dm.callJNI_OnLoad(emulator); // 手动执行JNI_OnLoad函数

NativeLibHelper = vm.resolveClass("xxxxx/android/xxxxxxxx/SecurityUtil").newObject(null);//你懂的

}

public void Sign(){

String traceFile = "traceCode.txt";

PrintStream traceStream;

try {

traceStream = new PrintStream(new FileOutputStream(traceFile), true);

emulator.traceCode(module.base,module.base+module.size).setRedirect(traceStream);

} catch (FileNotFoundException e) {

e.printStackTrace();

}

byte[] bytes = {100,52,54,102,54,101,100,55,55,57,57,55,51,56,102,97,48,52,57,54,97,56,50,57,53,52,97,49,50,51,54,101};

ByteArray barr = new ByteArray(vm,bytes);

StringObject str = new StringObject(vm,"getdata");

String stringObject = NativeLibHelper.callJniMethodObject(emulator,"s***leSign([BLjava/lang/String;)Ljava/lang/String;",barr,str).toString().replace("\"","");

Inspector.inspect(stringObject.getBytes(StandardCharsets.UTF_8),"result");

return;

}

public static void main(String[] args) {

SecurityUtil securityUtil = new SecurityUtil();

securityUtil.Sign();

}

from capstone import *

from keystone import *
 
cs = Cs(CS_ARCH_ARM, CS_MODE_THUMB)

ks = Ks(keystone.KS_ARCH_ARM, keystone.KS_MODE_THUMB)
 
def generate(code, addr):

    # 参数2是地址，很多指令是地址相关的，比如 B 指令，如果地址无关直接传 0 即可，比如 nop。

    encoding, _ = ks.asm(code, addr)

    return encoding
 
def get_opcode(machine_code, code_address):

    #利用capstone反汇编代码

    assembly = []

    for insn in cs.disasm(machine_code, code_address):

        if insn.mnemonic != "":

            assembly.append(insn.mnemonic)

            assembly.append(insn.op_str)
 
    return assembly

def patch_b(addr, target_addr):

    code = f"B {hex(target_addr)}"

    bCode = generate(code, addr)
 
    # 此处本意是在获取到真实跳转地址后立即patch 后在执行过程中发现 

    # 当前面被patch后会影响后面其他位置真实位置的计算 故作罢

    if (bCode != None):

        #ida_bytes.patch_bytes(addr, bytes(bCode))

        # print("patch:", hex(addr),"  code:",code)

        print(hex(addr)+“|”+code)
 
def patch(addr):

    if idc.get_wide_word(addr) == 0xb503:  # PUSH   {R0,R1,LR}

        addr_ = addr + 2

        if idc.get_wide_word(addr_) == 0x4801:  # ldr r0, [pc, #4] 从pc+4处取出

            lr = ""

            jump_code_addr = addr_ + 4 + 4

            if (jump_code_addr % 4 == 2):

                jump_code_addr = jump_code_addr - 2  # 做四字节对其

            jump_code = idc.get_wide_dword(jump_code_addr)
 
            # 开始判断ldr r0, [pc, #4]的下一句是否为BL

            addr_ = addr_ + 2

            code = idc.get_wide_dword(addr_).to_bytes(4, byteorder='little')  # 小端序取出四个字节

            opcode = get_opcode(code, addr_)
 
            if len(opcode) != 0 and opcode[0] == 'bl':  # 判断是否跳转语句

                jump_addr_1 = int(opcode[1][1:], 16)

                code = idc.get_wide_dword(jump_addr_1).to_bytes(4, byteorder='little')

                opcode = get_opcode(code, jump_addr_1)
 
                if len(opcode) != 0 and opcode[0] == 'bl':  # 判断是否有二次跳转

                    jump_addr_2 = int(opcode[1][1:], 16)

                    code = idc.get_wide_dword(jump_addr_2).to_bytes(4, byteorder='little')

                    opcode = get_opcode(code, jump_addr_2)

                    if len(opcode) != 0 and opcode[0] == 'bx':

                        lr = jump_addr_1 + 4
 
                elif len(opcode) != 0 and opcode[0] == 'bx':

                    lr = addr_ + 4

            # print("lr:"+lr)

            if lr != "":

                r1 = idc.get_wide_dword(lr + (jump_code << 2))

                real_jump_addr = lr + r1

                patch_b(addr, real_jump_addr)
 
if __name__ == '__main__':

    for i in range(0x9EB8, 0x218Acc):   #patch 整个.data段

        patch(i)

from capstone import *

from keystone import *

cs = Cs(CS_ARCH_ARM, CS_MODE_THUMB)

ks = Ks(keystone.KS_ARCH_ARM, keystone.KS_MODE_THUMB)

def generate(code, addr):

# 参数2是地址，很多指令是地址相关的，比如 B 指令，如果地址无关直接传 0 即可，比如 nop。

encoding, _ = ks.asm(code, addr)

return encoding

def get_opcode(machine_code, code_address):

#利用capstone反汇编代码

assembly = []

for insn in cs.disasm(machine_code, code_address):

if insn.mnemonic != "":

assembly.append(insn.mnemonic)

assembly.append(insn.op_str)

return assembly

def patch_b(addr, target_addr):

code = f"B {hex(target_addr)}"

bCode = generate(code, addr)

# 此处本意是在获取到真实跳转地址后立即patch 后在执行过程中发现

# 当前面被patch后会影响后面其他位置真实位置的计算故作罢

if (bCode != None):

#ida_bytes.patch_bytes(addr, bytes(bCode))

# print("patch:", hex(addr)," code:",code)

print(hex(addr)+“|”+code)

def patch(addr):

if idc.get_wide_word(addr) == 0xb503: # PUSH {R0,R1,LR}

addr_ = addr + 2

if idc.get_wide_word(addr_) == 0x4801: # ldr r0, [pc, #4] 从pc+4处取出

lr = ""

jump_code_addr = addr_ + 4 + 4

if (jump_code_addr % 4 == 2):

jump_code_addr = jump_code_addr - 2 # 做四字节对其

jump_code = idc.get_wide_dword(jump_code_addr)

# 开始判断ldr r0, [pc, #4]的下一句是否为BL

addr_ = addr_ + 2

code = idc.get_wide_dword(addr_).to_bytes(4, byteorder='little') # 小端序取出四个字节

opcode = get_opcode(code, addr_)

if len(opcode) != 0 and opcode[0] == 'bl': # 判断是否跳转语句

jump_addr_1 = int(opcode[1][1:], 16)

code = idc.get_wide_dword(jump_addr_1).to_bytes(4, byteorder='little')

opcode = get_opcode(code, jump_addr_1)

if len(opcode) != 0 and opcode[0] == 'bl': # 判断是否有二次跳转

jump_addr_2 = int(opcode[1][1:], 16)

code = idc.get_wide_dword(jump_addr_2).to_bytes(4, byteorder='little')

opcode = get_opcode(code, jump_addr_2)

if len(opcode) != 0 and opcode[0] == 'bx':

lr = jump_addr_1 + 4

elif len(opcode) != 0 and opcode[0] == 'bx':

lr = addr_ + 4

# print("lr:"+lr)

if lr != "":

r1 = idc.get_wide_dword(lr + (jump_code << 2))

real_jump_addr = lr + r1

patch_b(addr, real_jump_addr)

if __name__ == '__main__':

for i in range(0x9EB8, 0x218Acc): #patch 整个.data段

patch(i)

0xe556|B 0x17a82

0xeb02|B 0x16ca8

0xeb48|B 0x14972
...

0x1c03b2|B 0x1c0312

0x1c03d4|B 0x1c020c

0xe556|B 0x17a82

0xeb02|B 0x16ca8

0xeb48|B 0x14972

...

0x1c03b2|B 0x1c0312

0x1c03d4|B 0x1c020c

from keystone import *
 
ks = Ks(keystone.KS_ARCH_ARM, keystone.KS_MODE_THUMB)
 
def generate(code, addr):

    encoding, _ = ks.asm(code, addr)

    return encoding
 
def patch_b(code, addr):

    bCode = generate(code, addr)

    if (bCode != None):

        ida_bytes.patch_bytes(addr, bytes(bCode))
 
if __name__ == '__main__':

    with open('patch.txt', 'r') as file:

        for line in file:

            parts = line.split('|')

            addr = parts[0]

            code = parts[1].rstrip('\n')

            patch_b(code,int(addr,16))

from keystone import *

ks = Ks(keystone.KS_ARCH_ARM, keystone.KS_MODE_THUMB)

def generate(code, addr):

encoding, _ = ks.asm(code, addr)

return encoding

def patch_b(code, addr):

bCode = generate(code, addr)

if (bCode != None):

ida_bytes.patch_bytes(addr, bytes(bCode))

if __name__ == '__main__':

with open('patch.txt', 'r') as file:

for line in file:

parts = line.split('|')

addr = parts[0]

code = parts[1].rstrip('\n')

patch_b(code,int(addr,16))

src/main/java/com/github/unidbg/unix/UnixSyscallHandler.java
将类中gettimeofday()方法中的取时间固定
//   long currentTimeMillis = System.currentTimeMillis();

     long currentTimeMillis = 0000000000000L;

      
当时间固定后 发现结果也随之固定了
0000: 35 32 36 36 42 42 39 42 46 36 35 42 43 35 33 45    5266BB9BF65BC53E
0010: 46 32 33 39 39 44 33 41 30 44 30 30 42 44 35 44    F2399D3A0D00BD5D
0020: 30 30 30 30 30 30 30 30 31 32 43 34 31 41 46 36    0000000012C41AF6
0030: 34 35 43 42 30 32 44 38 35 38 44 33 45 42 44 35    45CB02D858D3EBD5
0040: 46 46 44 38 32 45 35 44 31 35 30                   FFD82E5D150
观察到结果的33-40位居然是00000000 这会跟上面固定的时间有关吗
 
再次改变时间：

    long currentTimeMillis = 1234567898765L;
 
0000: 32 44 41 43 41 37 41 31 36 43 44 44 41 34 31 37    2DACA7A16CDDA417
0010: 33 37 30 43 31 45 42 39 43 43 34 30 34 45 44 32    370C1EB9CC404ED2
0020: 44 41 30 32 39 36 34 39 31 32 43 43 42 31 33 43    DA02964912CCB13C
0030: 41 46 34 36 43 45 31 37 32 38 33 38 45 44 32 46    AF46CE172838ED2F
0040: 36 33 31 36 38 39 37 44 37 34 30                   6316897D740
 
发现这次结果的33-40位变成了DA029649 把他转为小端序再转16进制 发现是1234567898
即十位数的时间戳
 
综上可知 结果的33-40位是十位数的时间戳 41-43位则是固定的12c

          

src/main/java/com/github/unidbg/unix/UnixSyscallHandler.java

将类中gettimeofday()方法中的取时间固定

// long currentTimeMillis = System.currentTimeMillis();

long currentTimeMillis = 0000000000000L;

当时间固定后发现结果也随之固定了

0000: 35 32 36 36 42 42 39 42 46 36 35 42 43 35 33 45 5266BB9BF65BC53E

0010: 46 32 33 39 39 44 33 41 30 44 30 30 42 44 35 44 F2399D3A0D00BD5D

0020: 30 30 30 30 30 30 30 30 31 32 43 34 31 41 46 36 0000000012C41AF6

0030: 34 35 43 42 30 32 44 38 35 38 44 33 45 42 44 35 45CB02D858D3EBD5

0040: 46 46 44 38 32 45 35 44 31 35 30 FFD82E5D150

观察到结果的33-40位居然是00000000 这会跟上面固定的时间有关吗

再次改变时间：

long currentTimeMillis = 1234567898765L;

0000: 32 44 41 43 41 37 41 31 36 43 44 44 41 34 31 37 2DACA7A16CDDA417

0010: 33 37 30 43 31 45 42 39 43 43 34 30 34 45 44 32 370C1EB9CC404ED2

0020: 44 41 30 32 39 36 34 39 31 32 43 43 42 31 33 43 DA02964912CCB13C

0030: 41 46 34 36 43 45 31 37 32 38 33 38 45 44 32 46 AF46CE172838ED2F

0040: 36 33 31 36 38 39 37 44 37 34 30 6316897D740

发现这次结果的33-40位变成了DA029649 把他转为小端序再转16进制发现是1234567898

即十位数的时间戳

综上可知结果的33-40位是十位数的时间戳 41-43位则是固定的12c

// 截取部分

private static long II(long a, long b, long c, long d, long x, long s,

                       long ac) {

    a += (I(b, c, d)&0xFFFFFFFFL) + x + ac;

    a = ((a&0xFFFFFFFFL) << s) | ((a&0xFFFFFFFFL) >>> (32 - s));

    a += b;

    return (a&0xFFFFFFFFL);
}

private static long I(long x, long y, long z) {

    return y ^ (x | (~z));
}
 
由md5的最后一轮计算 可知

a = II(a, b, c, d, M4, 6, t[60])        //a = b+((a+I(b,c,d)+M4 +t[60])<<<6)   <<<6表示循环左移6

b = II(d, a, b, c, M11, 10, t[61])      //b = a+((d+I(a,b,c)+M11+t[61])<<<10)

c = II(c, d, a, b, M2, 15, t[62])       //c = d+((c+I(d,a,b)+M2 +t[62])<<<15)

d = II(b, c, d, a, M9, 21, t[63])       //d = c+((b+I(c,d,a)+M9 +t[63])<<<15)
 
因为逻辑运算左右移的互补关系 左移(x) = 右移(32-x)
 
把这些计算代入到上方ida中的伪c代码 流程十分的契合 结果的前32位大概率就是md5哈希校验了

// 截取部分

private static long II(long a, long b, long c, long d, long x, long s,

long ac) {

a += (I(b, c, d)&0xFFFFFFFFL) + x + ac;

a = ((a&0xFFFFFFFFL) << s) | ((a&0xFFFFFFFFL) >>> (32 - s));

a += b;

return (a&0xFFFFFFFFL);

}

private static long I(long x, long y, long z) {

return y ^ (x | (~z));

}

由md5的最后一轮计算可知

a = II(a, b, c, d, M4, 6, t[60]) //a = b+((a+I(b,c,d)+M4 +t[60])<<<6) <<<6表示循环左移6

b = II(d, a, b, c, M11, 10, t[61]) //b = a+((d+I(a,b,c)+M11+t[61])<<<10)

c = II(c, d, a, b, M2, 15, t[62]) //c = d+((c+I(d,a,b)+M2 +t[62])<<<15)

d = II(b, c, d, a, M9, 21, t[63]) //d = c+((b+I(c,d,a)+M9 +t[63])<<<15)

因为逻辑运算左右移的互补关系左移(x) = 右移(32-x)

把这些计算代入到上方ida中的伪c代码流程十分的契合结果的前32位大概率就是md5哈希校验了

// 第一轮

a = FF(a, b, c, d, M0, 7, 0xd76aa478)

b = FF(d, a, b, c, M1, 12, 0xe8c7b756)

c = FF(c, d, a, b, M2, 17, 0x242070db)

d = FF(b, c, d, a, M3, 22, 0xc1bdceee)

a = FF(a, b, c, d, M4, 7, 0xf57c0faf)

b = FF(d, a, b, c, M5, 12, 0x4787c62a)

c = FF(c, d, a, b, M6, 17, 0xa8304613)

d = FF(b, c, d, a, M7, 22, 0xfd469501)

a = FF(a, b, c, d, M8, 7, 0x698098d8)

b = FF(d, a, b, c, M9, 12, 0x8b44f7af)

c = FF(c, d, a, b, M10, 17, 0xffff5bb1)

d = FF(b, c, d, a, M11, 22, 0x895cd7be)

a = FF(a, b, c, d, M12, 7, 0x6b901122)

b = FF(d, a, b, c, M13, 12, 0xfd987193)

c = FF(c, d, a, b, M14, 17, 0xa679438e)

d = FF(b, c, d, a, M15, 22, 0x49b40821)
 
private static long F(long x, long y, long z) {

    return (x & y) | ((~x) & z);
}
 
// 其中 Mj表示明文块j
FF = a=b+((a+F(b,c,d)+Mj+ti)<<<s)
只需要找到Mj 即找到了明文

// 第一轮

a = FF(a, b, c, d, M0, 7, 0xd76aa478)

b = FF(d, a, b, c, M1, 12, 0xe8c7b756)

c = FF(c, d, a, b, M2, 17, 0x242070db)

d = FF(b, c, d, a, M3, 22, 0xc1bdceee)

a = FF(a, b, c, d, M4, 7, 0xf57c0faf)

b = FF(d, a, b, c, M5, 12, 0x4787c62a)

c = FF(c, d, a, b, M6, 17, 0xa8304613)

d = FF(b, c, d, a, M7, 22, 0xfd469501)

a = FF(a, b, c, d, M8, 7, 0x698098d8)

b = FF(d, a, b, c, M9, 12, 0x8b44f7af)

c = FF(c, d, a, b, M10, 17, 0xffff5bb1)

d = FF(b, c, d, a, M11, 22, 0x895cd7be)

a = FF(a, b, c, d, M12, 7, 0x6b901122)

b = FF(d, a, b, c, M13, 12, 0xfd987193)

c = FF(c, d, a, b, M14, 17, 0xa679438e)

d = FF(b, c, d, a, M15, 22, 0x49b40821)

private static long F(long x, long y, long z) {

return (x & y) | ((~x) & z);

}

// 其中 Mj表示明文块j

FF = a=b+((a+F(b,c,d)+Mj+ti)<<<s)

只需要找到Mj 即找到了明文

"ldr r4, [pc, #0x358]" => r4=0x1fdf9fdf 

"ldr r4, [pc, #0x358]" => r4=0x97571757 

"ldr r4, [pc, #0x35c]" => r4=0x68a8e8a8 
 
发现参与计算的几个数都是直接通过pc指针+偏移进行读取 为固定值 猜测这就是md5的初始模数
 
0x120360 "eors r4, r5" r4=0x1fdf9fdf r5=0x97571757 => r4=0x88888888
 
0x120364 "ands r4, r1" r4=0x88888888 r1=0x68a8e8a8 => r4=0x8888888
 
0x120366 "eors r4, r5" r4=0x8888888 r5=0x97571757 => r4=0x9fdf9fdf
 
(b & c) ^ ((~b) & d) => [(c ^ d) & b] ^ d 
发现上述计算等价于F(b,c,d) 
至此 确定了md5计算的初始模数a,b,c,d

A = 0xe0206020

B = 0x68a8e8a8

C = 0x1fdf9fdf

D = 0x97571757

"ldr r4, [pc, #0x358]" => r4=0x1fdf9fdf

"ldr r4, [pc, #0x358]" => r4=0x97571757

"ldr r4, [pc, #0x35c]" => r4=0x68a8e8a8

发现参与计算的几个数都是直接通过pc指针+偏移进行读取 为固定值 猜测这就是md5的初始模数

0x120360 "eors r4, r5" r4=0x1fdf9fdf r5=0x97571757 => r4=0x88888888

0x120364 "ands r4, r1" r4=0x88888888 r1=0x68a8e8a8 => r4=0x8888888

0x120366 "eors r4, r5" r4=0x8888888 r5=0x97571757 => r4=0x9fdf9fdf

(b & c) ^ ((~b) & d) => [(c ^ d) & b] ^ d

发现上述计算等价于F(b,c,d)

至此确定了md5计算的初始模数a,b,c,d

A = 0xe0206020

B = 0x68a8e8a8

C = 0x1fdf9fdf

D = 0x97571757

00000000  30 30 30 30 30 30 30 30 5a 62 30 61 64 76 4a 32  |00000000Zb0advJ2|
00000010  52 74 43 69 6a 4e 72 38 64 55 70 49 35 31 50 5a  |RtCijNr8dUpI51PZ|
00000020  6e 68 62 55 4b 67 33 57 4c 4d 68 4f 6d 53 33 41  |nhbUKg3WLMhOmS3A|
00000030  75 55 50 55 78 6b 39 35 68 77 37 68 64 4a 79 34  |uUPUxk95hw7hdJy4|
00000040  63 4a 72 4c 70 4d 50 64 78 54 72 49 68 31 67 37  |cJrLpMPdxTrIh1g7|
00000050  79 4c 50 75 48 44 4a 61 78 6b 36 6c 51 4d 71 6d  |yLPuHDJaxk6lQMqm|
00000060  80 76 79 31 00 00 00 00 00 00 00 00 00 00 00 00  |.vy1............|
00000070  00 00 00 00 00 00 00 00 00 00 03 18 00 00 00 00  |................|
 
转为大端序
00000000  30 30 30 30 30 30 30 30 61 30 62 5a 32 4a 76 64  |00000000a0bZ2Jvd|
00000010  69 43 74 52 38 72 4e 6a 49 70 55 64 5a 50 31 35  |iCtR8rNjIpUdZP15|
00000020  55 62 68 6e 57 33 67 4b 4f 68 4d 4c 41 33 53 6d  |UbhnW3gKOhMLA3Sm|
00000030  55 50 55 75 35 39 6b 78 68 37 77 68 34 79 4a 64  |UPUu59kxh7wh4yJd|
00000040  4c 72 4a 63 64 50 4d 70 49 72 54 78 37 67 31 68  |LrJcdPMpIrTx7g1h|
00000050  75 50 4c 79 61 4a 44 48 6c 36 6b 78 6d 71 4d 51  |uPLyaJDHl6kxmqMQ|
00000060  31 79 76 80 00 00 00 00 00 00 00 00 00 00 00 00  |1yv.............|
00000070  00 00 00 00 00 00 00 00 18 03 00 00 00 00 00 00  |................|
 
发现完美符合md5明文拓展后的特征(明文以0x80结尾,倒数第8字节存放明文长度*8)

00000000 30 30 30 30 30 30 30 30 5a 62 30 61 64 76 4a 32 |00000000Zb0advJ2|

00000010 52 74 43 69 6a 4e 72 38 64 55 70 49 35 31 50 5a |RtCijNr8dUpI51PZ|

00000020 6e 68 62 55 4b 67 33 57 4c 4d 68 4f 6d 53 33 41 |nhbUKg3WLMhOmS3A|

00000030 75 55 50 55 78 6b 39 35 68 77 37 68 64 4a 79 34 |uUPUxk95hw7hdJy4|

00000040 63 4a 72 4c 70 4d 50 64 78 54 72 49 68 31 67 37 |cJrLpMPdxTrIh1g7|

00000050 79 4c 50 75 48 44 4a 61 78 6b 36 6c 51 4d 71 6d |yLPuHDJaxk6lQMqm|

00000060 80 76 79 31 00 00 00 00 00 00 00 00 00 00 00 00 |.vy1............|

00000070 00 00 00 00 00 00 00 00 00 00 03 18 00 00 00 00 |................|

转为大端序

00000000 30 30 30 30 30 30 30 30 61 30 62 5a 32 4a 76 64 |00000000a0bZ2Jvd|

00000010 69 43 74 52 38 72 4e 6a 49 70 55 64 5a 50 31 35 |iCtR8rNjIpUdZP15|

00000020 55 62 68 6e 57 33 67 4b 4f 68 4d 4c 41 33 53 6d |UbhnW3gKOhMLA3Sm|

00000030 55 50 55 75 35 39 6b 78 68 37 77 68 34 79 4a 64 |UPUu59kxh7wh4yJd|

00000040 4c 72 4a 63 64 50 4d 70 49 72 54 78 37 67 31 68 |LrJcdPMpIrTx7g1h|

00000050 75 50 4c 79 61 4a 44 48 6c 36 6b 78 6d 71 4d 51 |uPLyaJDHl6kxmqMQ|

00000060 31 79 76 80 00 00 00 00 00 00 00 00 00 00 00 00 |1yv.............|

00000070 00 00 00 00 00 00 00 00 18 03 00 00 00 00 00 00 |................|

发现完美符合md5明文拓展后的特征(明文以0x80结尾,倒数第8字节存放明文长度*8)

0x500fe759L /* 1 */
0x6fa2f477L /* 2 */
0xa34533faL /* 3 */
...
0xadb2919aL /* 63 */
0x6ce390b0L /* 64 */

0x500fe759L /* 1 */

0x6fa2f477L /* 2 */

0xa34533faL /* 3 */

...

0xadb2919aL /* 63 */

0x6ce390b0L /* 64 */

public class magicMD5 {
 
    static final String[] hexs = {"0", "1", "2", "3", "4", "5", "6", "7", "8", "9", "A", "B", "C", "D", "E", "F"};
 
    private static final long A = 0xe0206020L;

    private static final long B = 0x68a8e8a8L;

    private static final long C = 0x1fdf9fdfL;

    private static final long D = 0x97571757L;
 
    //下面这些S11-S44实际上是一个4*4的矩阵，在四轮循环运算中用到

    static final int S11 = 7;

    static final int S12 = 12;

    static final int S13 = 17;

    static final int S14 = 22;
 
    static final int S21 = 5;

    static final int S22 = 9;

    static final int S23 = 14;

    static final int S24 = 20;
 
    static final int S31 = 4;

    static final int S32 = 11;

    static final int S33 = 16;

    static final int S34 = 23;
 
    static final int S41 = 6;

    static final int S42 = 10;

    static final int S43 = 15;

    static final int S44 = 21;
 
    //java不支持无符号的基本数据（unsigned）

    private long[] result = {A, B, C, D};//存储hash结果，共4×32=128位，初始化值为（幻数的级联）
 
    public static void main(String[] args) {

        magicMD5 md = new magicMD5();

        System.out.println(md.digest("30303030303030306130625a324a76646943745238724e6a497055645a5031355562686e5733674b4f684d4c4133536d5550557535396b786837776834794a644c724a6364504d70497254783767316875504c79614a44486c366b786d714d51317976"));

    }
 
    private String digest(String inputHexStr) {
 
        byte[] inputBytes = hexToByteArray(inputHexStr);

        int byteLen = inputBytes.length;//长度（字节）

        int groupCount = 0;//完整分组的个数

        groupCount = byteLen / 64;//每组512位（64字节）

        long[] groups = null;//每个小组(64字节)再细分后的16个小组(4字节)
 
        //处理每一个完整 分组

        for (int step = 0; step < groupCount; step++) {

            groups = divGroup(inputBytes, step * 64);

            trans(groups);//处理分组，核心算法

        }
 
        //处理完整分组后的尾巴

        int rest = byteLen % 64;//512位分组后的余数

        byte[] tempBytes = new byte[64];

        if (rest <= 56) {

            for (int i = 0; i < rest; i++)

            tempBytes[i] = inputBytes[byteLen - rest + i];

            if (rest < 56) {

                tempBytes[rest] = (byte) (1 << 7);

                for (int i = 1; i < 56 - rest; i++)

                tempBytes[rest + i] = 0;

            }

            long len = (long) (byteLen << 3);

            for (int i = 0; i < 8; i++) {

                tempBytes[56 + i] = (byte) (len & 0xFFL);

                len = len >> 8;

            }

            groups = divGroup(tempBytes, 0);

            trans(groups);//处理分组

        } else {

            for (int i = 0; i < rest; i++)

            tempBytes[i] = inputBytes[byteLen - rest + i];

            tempBytes[rest] = (byte) (1 << 7);

            for (int i = rest + 1; i < 64; i++)

            tempBytes[i] = 0;

            groups = divGroup(tempBytes, 0);

            trans(groups);//处理分组
 
            for (int i = 0; i < 56; i++)

            tempBytes[i] = 0;

            long len = (long) (byteLen << 3);

            for (int i = 0; i < 8; i++) {

                tempBytes[56 + i] = (byte) (len & 0xFFL);

                len = len >> 8;

            }

            groups = divGroup(tempBytes, 0);

            trans(groups);//处理分组

        }
 
        //将Hash值转换成十六进制的字符串

        String resStr = "";

        long temp = 0;

        for (int i = 0; i < 4; i++) {

            for (int j = 0; j < 4; j++) {

                temp = result[i] & 0x0FL;

                String a = hexs[(int) (temp)];

                result[i] = result[i] >> 4;

                temp = result[i] & 0x0FL;

                resStr += hexs[(int) (temp)] + a;

                result[i] = result[i] >> 4;

            }

        }

        return resStr;

    }
 
    /**

     * 从inputBytes的index开始取512位，作为新的分组

     * 将每一个512位的分组再细分成16个小组，每个小组64位（8个字节）

     *

     * @param inputBytes

     * @param index

     * @return

     */

    private static long[] divGroup(byte[] inputBytes, int index) {

        long[] temp = new long[16];

        for (int i = 0; i < 16; i++) {

            temp[i] = b2iu(inputBytes[4 * i + index]) |

                    (b2iu(inputBytes[4 * i + 1 + index])) << 8 |

                    (b2iu(inputBytes[4 * i + 2 + index])) << 16 |

                    (b2iu(inputBytes[4 * i + 3 + index])) << 24;

        }

        return temp;

    }
 
    /**

     * 这时不存在符号位（符号位存储不再是代表正负），所以需要处理一下

     *

     * @param b

     * @return

     */

    public static long b2iu(byte b) {

        return b < 0 ? b & 0x7F + 128 : b;

    }
 
    private void trans(long[] groups) {

        long a = result[0], b = result[1], c = result[2], d = result[3];

        /*第一轮*/

        a = FF(a, b, c, d, groups[0], S11,  0x500fe759L); /* 1 */

        d = FF(d, a, b, c, groups[1], S12,  0x6fa2f477L); /* 2 */

        c = FF(c, d, a, b, groups[2], S13,  0xa34533faL); /* 3 */

        b = FF(b, c, d, a, groups[3], S14,  0x46d88dcfL); /* 4 */

        a = FF(a, b, c, d, groups[4], S11,  0x72194c8eL); /* 5 */

        d = FF(d, a, b, c, groups[5], S12,  0xc0e2850bL); /* 6 */

        c = FF(c, d, a, b, groups[6], S13,  0x2f550532L); /* 7 */

        b = FF(b, c, d, a, groups[7], S14,  0x7a23d620L); /* 8 */

        a = FF(a, b, c, d, groups[8], S11,  0xeee5dbf9L); /* 9 */

        d = FF(d, a, b, c, groups[9], S12,  0x0c21b48eL); /* 10 */

        c = FF(c, d, a, b, groups[10], S13, 0x789a1890L); /* 11 */

        b = FF(b, c, d, a, groups[11], S14, 0x0e39949fL); /* 12 */

        a = FF(a, b, c, d, groups[12], S11, 0xecf55203L); /* 13 */

        d = FF(d, a, b, c, groups[13], S12, 0x7afd32b2L); /* 14 */

        c = FF(c, d, a, b, groups[14], S13, 0x211c00afL); /* 15 */

        b = FF(b, c, d, a, groups[15], S14, 0xced14b00L); /* 16 */
 
        /*第二轮*/

        a = GG(a, b, c, d, groups[1], S21,  0x717b6643L); /* 17 */

        d = GG(d, a, b, c, groups[6], S22,  0x4725f061L); /* 18 */

        c = GG(c, d, a, b, groups[11], S23, 0xa13b1970L); /* 19 */

        b = GG(b, c, d, a, groups[0], S24,  0x6ed3848bL); /* 20 */

        a = GG(a, b, c, d, groups[5], S21,  0x514a537cL); /* 21 */

        d = GG(d, a, b, c, groups[10], S22, 0x85215772L); /* 22 */

        c = GG(c, d, a, b, groups[15], S23, 0x5fc4a5a0L); /* 23 */

        b = GG(b, c, d, a, groups[4], S24,  0x60b6b8e9L); /* 24 */

        a = GG(a, b, c, d, groups[9], S21,  0xa6848ec7L); /* 25 */

        d = GG(d, a, b, c, groups[14], S22, 0x445244f7L); /* 26 */

        c = GG(c, d, a, b, groups[3], S23,  0x73b04ea6L); /* 27 */

        b = GG(b, c, d, a, groups[8], S24,  0xc23f57ccL); /* 28 */

        a = GG(a, b, c, d, groups[13], S21, 0x2e86aa24L); /* 29 */

        d = GG(d, a, b, c, groups[2], S22,  0x7b8ae0d9L); /* 30 */

        c = GG(c, d, a, b, groups[7], S23,  0xe00a41f8L); /* 31 */

        b = GG(b, c, d, a, groups[12], S24, 0x0a4f0fabL); /* 32 */
 
        /*第三轮*/

        a = HH(a, b, c, d, groups[5], S31,  0x789f7a63L); /* 33 */

        d = HH(d, a, b, c, groups[8], S32,  0x0014b5a0L); /* 34 */

        c = HH(c, d, a, b, groups[11], S33, 0xeaf82203L); /* 35 */

        b = HH(b, c, d, a, groups[14], S34, 0x7a807b2dL); /* 36 */

        a = HH(a, b, c, d, groups[1], S31,  0x23dba965L); /* 37 */

        d = HH(d, a, b, c, groups[4], S32,  0xccbb8c88L); /* 38 */

        c = HH(c, d, a, b, groups[7], S33,  0x71de0841L); /* 39 */

        b = HH(b, c, d, a, groups[10], S34, 0x39daff51L); /* 40 */

        a = HH(a, b, c, d, groups[13], S31, 0xaffe3de7L); /* 41 */

        d = HH(d, a, b, c, groups[0], S32,  0x6dc464dbL); /* 42 */

        c = HH(c, d, a, b, groups[3], S33,  0x538a73a4L); /* 43 */

        b = HH(b, c, d, a, groups[6], S34,  0x83ed5e24L); /* 44 */

        a = HH(a, b, c, d, groups[9], S31,  0x5eb19318L); /* 45 */

        d = HH(d, a, b, c, groups[12], S32, 0x61bedac4L); /* 46 */

        c = HH(c, d, a, b, groups[15], S33, 0x98c73fd9L); /* 47 */

        b = HH(b, c, d, a, groups[2], S34,  0x43c91544L); /* 48 */
 
        /*第四轮*/

        a = II(a, b, c, d, groups[0], S41,  0x734c6165L); /* 49 */

        d = II(d, a, b, c, groups[7], S42,  0xc44fbcb6L); /* 50 */

        c = II(c, d, a, b, groups[14], S43, 0x2cf16086L); /* 51 */

        b = II(b, c, d, a, groups[5], S44,  0x7bf6e318L); /* 52 */

        a = II(a, b, c, d, groups[12], S41, 0xe23e1ae2L); /* 53 */

        d = II(d, a, b, c, groups[3], S42,  0x08698fb3L); /* 54 */

        c = II(c, d, a, b, groups[10], S43, 0x788ab75cL); /* 55 */

        b = II(b, c, d, a, groups[1], S44,  0x02e11ef0L); /* 56 */

        a = II(a, b, c, d, groups[8], S41,  0xe8cd3d6eL); /* 57 */

        d = II(d, a, b, c, groups[15], S42, 0x7949a5c1L); /* 58 */

        c = II(c, d, a, b, groups[6], S43,  0x24640035L); /* 59 */

        b = II(b, c, d, a, groups[13], S44, 0xc96d5280L); /* 60 */

        a = II(a, b, c, d, groups[4], S41,  0x70363da3L); /* 61 */

        d = II(d, a, b, c, groups[11], S42, 0x3a5fb114L); /* 62 */

        c = II(c, d, a, b, groups[2], S43,  0xadb2919aL); /* 63 */

        b = II(b, c, d, a, groups[9], S44,  0x6ce390b0L); /* 64 */
 
        /*加入到之前计算的结果当中*/

        result[0] += a;

        result[1] += b;

        result[2] += c;

        result[3] += d;

        result[0] = result[0] & 0xFFFFFFFFL;

        result[1] = result[1] & 0xFFFFFFFFL;

        result[2] = result[2] & 0xFFFFFFFFL;

        result[3] = result[3] & 0xFFFFFFFFL;

    }
 
    /**

     * 下面是处理要用到的线性函数

     */

    private static long F(long x, long y, long z) {

        return (x & y) | ((~x) & z);

    }
 
    private static long G(long x, long y, long z) {

        return (x & z) | (y & (~z));

    }
 
    private static long H(long x, long y, long z) {

        return x ^ y ^ z;

    }
 
    private static long I(long x, long y, long z) {

        return y ^ (x | (~z));

    }
 
    private static long FF(long a, long b, long c, long d, long x, long s,

                           long ac) {

        a += (F(b, c, d) & 0xFFFFFFFFL) + x + ac;

        a = ((a & 0xFFFFFFFFL) << s) | ((a & 0xFFFFFFFFL) >>> (32 - s));

        a += b;

        return (a & 0xFFFFFFFFL);

    }
 
    private static long GG(long a, long b, long c, long d, long x, long s,

                           long ac) {

        a += (G(b, c, d) & 0xFFFFFFFFL) + x + ac;

        a = ((a & 0xFFFFFFFFL) << s) | ((a & 0xFFFFFFFFL) >>> (32 - s));

        a += b;

        return (a & 0xFFFFFFFFL);

    }
 
    private static long HH(long a, long b, long c, long d, long x, long s,

                           long ac) {

        a += (H(b, c, d) & 0xFFFFFFFFL) + x + ac;

        a = ((a & 0xFFFFFFFFL) << s) | ((a & 0xFFFFFFFFL) >>> (32 - s));

        a += b;

        return (a & 0xFFFFFFFFL);

    }
 
    private static long II(long a, long b, long c, long d, long x, long s,

                           long ac) {

        a += (I(b, c, d) & 0xFFFFFFFFL) + x + ac;

        a = ((a & 0xFFFFFFFFL) << s) | ((a & 0xFFFFFFFFL) >>> (32 - s));

        a += b;

        return (a & 0xFFFFFFFFL);

    }
 
    private static byte hexToByte(String inHex){

        return (byte)Integer.parseInt(inHex,16);

    }

    public static byte[] hexToByteArray(String inHex){

        int hexlen = inHex.length();

        byte[] result;

        if (hexlen % 2 == 1){

            //奇数

            hexlen++;

            result = new byte[(hexlen/2)];

            inHex="0"+inHex;

        }else {

            //偶数

            result = new byte[(hexlen/2)];

        }

        int j=0;

        for (int i = 0; i < hexlen; i+=2){

            result[j]=hexToByte(inHex.substring(i,i+2));

            j++;

        }

        return result;

    }
 
}

public class magicMD5 {

static final String[] hexs = {"0", "1", "2", "3", "4", "5", "6", "7", "8", "9", "A", "B", "C", "D", "E", "F"};

private static final long A = 0xe0206020L;

private static final long B = 0x68a8e8a8L;

private static final long C = 0x1fdf9fdfL;

private static final long D = 0x97571757L;

//下面这些S11-S44实际上是一个4*4的矩阵，在四轮循环运算中用到

static final int S11 = 7;

static final int S12 = 12;

static final int S13 = 17;

static final int S14 = 22;

static final int S21 = 5;

static final int S22 = 9;

static final int S23 = 14;

static final int S24 = 20;

static final int S31 = 4;

static final int S32 = 11;

static final int S33 = 16;

static final int S34 = 23;

static final int S41 = 6;

static final int S42 = 10;

static final int S43 = 15;

static final int S44 = 21;

//java不支持无符号的基本数据（unsigned）

private long[] result = {A, B, C, D};//存储hash结果，共4×32=128位，初始化值为（幻数的级联）

public static void main(String[] args) {

magicMD5 md = new magicMD5();

System.out.println(md.digest(

"30303030303030306130625a324a76646943745238724e6a497055645a5031355562686e5733674b4f684d4c4133536d5550557535396b786837776834794a644c724a6364504d70497254783767316875504c79614a44486c366b786d714d51317976"

));

}

private String digest(String inputHexStr) {

byte[] inputBytes = hexToByteArray(inputHexStr);

int byteLen = inputBytes.length;//长度（字节）

int groupCount = 0;//完整分组的个数

groupCount = byteLen / 64;//每组512位（64字节）

long[] groups = null;//每个小组(64字节)再细分后的16个小组(4字节)

//处理每一个完整分组

for (int step = 0; step < groupCount; step++) {

groups = divGroup(inputBytes, step * 64);

trans(groups);//处理分组，核心算法

}

//处理完整分组后的尾巴

int rest = byteLen % 64;//512位分组后的余数

byte[] tempBytes = new byte[64];

if (rest <= 56) {

for (int i = 0; i < rest; i++)

tempBytes[i] = inputBytes[byteLen - rest + i];

if (rest < 56) {

tempBytes[rest] = (byte) (1 << 7);

for (int i = 1; i < 56 - rest; i++)

tempBytes[rest + i] = 0;

}

long len = (long) (byteLen << 3);

for (int i = 0; i < 8; i++) {

tempBytes[56 + i] = (byte) (len & 0xFFL);

len = len >> 8;

}

groups = divGroup(tempBytes, 0);

trans(groups);//处理分组

} else {

for (int i = 0; i < rest; i++)

tempBytes[i] = inputBytes[byteLen - rest + i];

tempBytes[rest] = (byte) (1 << 7);

for (int i = rest + 1; i < 64; i++)

tempBytes[i] = 0;

groups = divGroup(tempBytes, 0);

trans(groups);//处理分组

for (int i = 0; i < 56; i++)

tempBytes[i] = 0;

long len = (long) (byteLen << 3);

for (int i = 0; i < 8; i++) {

tempBytes[56 + i] = (byte) (len & 0xFFL);

len = len >> 8;

}

groups = divGroup(tempBytes, 0);

trans(groups);//处理分组

}

//将Hash值转换成十六进制的字符串

String resStr = "";

long temp = 0;

for (int i = 0; i < 4; i++) {

for (int j = 0; j < 4; j++) {

temp = result[i] & 0x0FL;

String a = hexs[(int) (temp)];

result[i] = result[i] >> 4;

temp = result[i] & 0x0FL;

resStr += hexs[(int) (temp)] + a;

result[i] = result[i] >> 4;

}

return resStr;

}

/**

* 从inputBytes的index开始取512位，作为新的分组

* 将每一个512位的分组再细分成16个小组，每个小组64位（8个字节）

*

* @param inputBytes

* @param index

* @return

*/

private static long[] divGroup(byte[] inputBytes, int index) {

long[] temp = new long[16];

for (int i = 0; i < 16; i++) {

temp[i] = b2iu(inputBytes[4 * i + index]) |

(b2iu(inputBytes[4 * i + 1 + index])) << 8 |

(b2iu(inputBytes[4 * i + 2 + index])) << 16 |

(b2iu(inputBytes[4 * i + 3 + index])) << 24;

}

return temp;

}

/**

* 这时不存在符号位（符号位存储不再是代表正负），所以需要处理一下

*

* @param b

* @return

*/

public static long b2iu(byte b) {

return b < 0 ? b & 0x7F + 128 : b;

}

private void trans(long[] groups) {

long a = result[0], b = result[1], c = result[2], d = result[3];

/*第一轮*/

a = FF(a, b, c, d, groups[0], S11, 0x500fe759L); /* 1 */

d = FF(d, a, b, c, groups[1], S12, 0x6fa2f477L); /* 2 */

c = FF(c, d, a, b, groups[2], S13, 0xa34533faL); /* 3 */

b = FF(b, c, d, a, groups[3], S14, 0x46d88dcfL); /* 4 */

登录后可查看完整内容

[培训]内核驱动高级班，冲击BAT一流互联网大厂工作，每周日13:00-18:00直播授课

最后于 2024-10-1 13:17 被劫__编辑，原因：

#逆向分析 #NDK分析 #混淆加固

收藏・43

免费・12

支持

最新回复 (20)
你瞒我瞒雪币： 2442 活跃值： (10708) 能力值： ( LV2，RANK：10 ) 在线值：发帖 4 回帖 254 粉丝 2 关注私信	你瞒我瞒 2 楼厉害，小白表示看不懂 2024-9-23 09:22 0
肉蚌葱鸡雪币： 20 活跃值： (993) 能力值： ( LV2，RANK：10 ) 在线值：发帖 1 回帖 16 粉丝 5 关注私信	肉蚌葱鸡 3 楼 2024-9-23 09:23 0
mb_ldbucrik 雪币： 10 能力值： ( LV1，RANK：0 ) 在线值：发帖 0 回帖 226 粉丝 2 关注私信	mb_ldbucrik 4 楼学习了，感谢楼主 2024-9-23 09:26 0
mb_lpcoesnt 雪币： 1363 活跃值： (1994) 能力值： ( LV2，RANK：10 ) 在线值：发帖 0 回帖 30 粉丝 2 关注私信	mb_lpcoesnt 5 楼太干了，看到中间就看迷糊了 2024-9-23 13:52 0
kingking888 雪币： 1357 活跃值： (2805) 能力值： ( LV2，RANK：10 ) 在线值：发帖 0 回帖 30 粉丝 1 关注私信	kingking888 6 楼楼主吓我一跳，打开一看以为是l libsgmain.so 最后于 2024-9-24 22:41 被kingking888编辑，原因： 2024-9-24 22:40 1
劫__ 雪币： 351 活跃值： (1032) 能力值： ( LV3，RANK：20 ) 在线值：发帖 2 回帖 10 粉丝 43 关注私信	劫__ 7 楼差了一个字母哈哈 2024-9-24 23:11 0
Ive_406746 雪币： 145 能力值： ( LV1，RANK：0 ) 在线值：发帖 0 回帖 17 粉丝 0 关注私信	Ive_406746 8 楼好文 2024-9-26 10:22 0
jfztaq 雪币： 202 活跃值： (1250) 能力值： ( LV2，RANK：10 ) 在线值：发帖 5 回帖 344 粉丝 1 关注私信	jfztaq 9 楼大佬，能讲下i国网怎么开启webview调试吗？hook不到 2024-9-27 13:00 0
一颗小草雪币： 571 活跃值： (190) 能力值： ( LV2，RANK：10 ) 在线值：发帖 1 回帖 12 粉丝 5 关注私信	一颗小草 10 楼你好，可以发个样本对照学习一下吗 2024-9-27 14:06 0
劫__ 雪币： 351 活跃值： (1032) 能力值： ( LV3，RANK：20 ) 在线值：发帖 2 回帖 10 粉丝 43 关注私信	劫__ 11 楼一颗小草你好，可以发个样本对照学习一下吗某trip软件豌豆荚可找到历史版本 2024-9-27 14:57 0
孤独的街雪币： 1996 活跃值： (2337) 能力值： ( LV2，RANK：10 ) 在线值：发帖 0 回帖 12 粉丝 2 关注私信	孤独的街 12 楼大佬，版本确定是8.0.65吗 2024-9-27 23:41 0
mb_hzvwwcvh 雪币：能力值： ( LV1，RANK：0 ) 在线值：发帖 0 回帖 1 粉丝 0 关注私信	mb_hzvwwcvh 13 楼不清楚是哪个软件 2024-9-30 19:13 0
mb_zfqvurgb 雪币： 6 能力值： ( LV1，RANK：0 ) 在线值：发帖 0 回帖 31 粉丝 1 关注私信	mb_zfqvurgb 14 楼 (b & c) ^ ((~b) & d) => [(c ^ d) & b] ^ d 这个看不懂，为啥 [(c ^ d) & b] ^ d 可以成为 (b & c) ^ ((~b) & d)呢，后者是 f(b,c,d) 2024-10-1 16:27 0
墨穹呢雪币： 2314 活跃值： (3057) 能力值： ( LV3，RANK：20 ) 在线值：发帖 2 回帖 66 粉丝 16 关注私信	墨穹呢 15 楼最新版的算法有变化吗 2024-10-2 20:49 0
寻梦之璐雪币： 731 活跃值： (1637) 能力值： ( LV2，RANK：10 ) 在线值：发帖 2 回帖 46 粉丝 9 关注私信	寻梦之璐 16 楼你好，龙哥的HexSearch大法这个是哪篇文章的哇，我咋找不到。。hexdatasearch这个类从哪找来的。。。哭死 2024-10-21 23:04 0
寻梦之璐雪币： 731 活跃值： (1637) 能力值： ( LV2，RANK：10 ) 在线值：发帖 2 回帖 46 粉丝 9 关注私信	寻梦之璐 17 楼寻梦之璐你好，龙哥的HexSearch大法这个是哪篇文章的哇，我咋找不到。。hexdatasearch这个类从哪找来的。。。哭死问了龙哥，龙哥也想不起来了，哭死 2024-10-21 23:05 0
寻梦之璐雪币： 731 活跃值： (1637) 能力值： ( LV2，RANK：10 ) 在线值：发帖 2 回帖 46 粉丝 9 关注私信	寻梦之璐 18 楼 mb_zfqvurgb (b & c) ^ ((~b) & d) => [(c ^ d) & b] ^ d 这个看不懂，为啥 [(c ^ d) & b] ^ d 可以成为 (b ... 佬，这里你推出来了嘛 2024-10-24 11:04 0
0xEA 雪币： 3098 活跃值： (4222) 能力值： ( LV2，RANK：10 ) 在线值：发帖 4 回帖 88 粉丝 1 关注私信	0xEA 19 楼 mb_zfqvurgb (b & c) ^ ((~b) & d) => [(c ^ d) & b] ^ d 这个看不懂，为啥 [(c ^ d) & b] ^ d 可以成为 (b ... F(b,c,d) = (b & c) \| ((~b) & d) 中间是｜楼主写成^了 2024-10-24 14:48 1
calleng 雪币： 31 活跃值： (3269) 能力值： ( LV2，RANK：10 ) 在线值：发帖 38 回帖 123 粉丝 60 关注私信	calleng 20 楼老板，非常好的文章，谢谢。有样本非常nice 2024-10-27 20:08 0
陈某人雪币： 694 活跃值： (3214) 能力值： ( LV2，RANK：10 ) 在线值：发帖 0 回帖 58 粉丝 2 关注私信	陈某人 21 楼感谢分享最后于 2024-11-14 20:54 被陈某人编辑，原因： 2024-11-14 20:53 0
	游客登录 \| 注册方可回帖回帖表情雪币赚取及消费高级回复

劫__

发帖

回帖

RANK

关注

私信

他的文章

关于我们

联系我们

企业服务

看雪公众号

最新回复 (20)
你瞒我瞒雪币： 2442 活跃值： (10708) 能力值： ( LV2，RANK：10 ) 在线值：发帖 4 回帖 254 粉丝 2 关注私信	你瞒我瞒 2 楼厉害，小白表示看不懂 2024-9-23 09:22 0
肉蚌葱鸡雪币： 20 活跃值： (993) 能力值： ( LV2，RANK：10 ) 在线值：发帖 1 回帖 16 粉丝 5 关注私信	肉蚌葱鸡 3 楼 2024-9-23 09:23 0
mb_ldbucrik 雪币： 10 能力值： ( LV1，RANK：0 ) 在线值：发帖 0 回帖 226 粉丝 2 关注私信	mb_ldbucrik 4 楼学习了，感谢楼主 2024-9-23 09:26 0
mb_lpcoesnt 雪币： 1363 活跃值： (1994) 能力值： ( LV2，RANK：10 ) 在线值：发帖 0 回帖 30 粉丝 2 关注私信	mb_lpcoesnt 5 楼太干了，看到中间就看迷糊了 2024-9-23 13:52 0
kingking888 雪币： 1357 活跃值： (2805) 能力值： ( LV2，RANK：10 ) 在线值：发帖 0 回帖 30 粉丝 1 关注私信	kingking888 6 楼楼主吓我一跳，打开一看以为是l libsgmain.so 最后于 2024-9-24 22:41 被kingking888编辑，原因： 2024-9-24 22:40 1
劫__ 雪币： 351 活跃值： (1032) 能力值： ( LV3，RANK：20 ) 在线值：发帖 2 回帖 10 粉丝 43 关注私信	劫__ 7 楼差了一个字母哈哈 2024-9-24 23:11 0
Ive_406746 雪币： 145 能力值： ( LV1，RANK：0 ) 在线值：发帖 0 回帖 17 粉丝 0 关注私信	Ive_406746 8 楼好文 2024-9-26 10:22 0
jfztaq 雪币： 202 活跃值： (1250) 能力值： ( LV2，RANK：10 ) 在线值：发帖 5 回帖 344 粉丝 1 关注私信	jfztaq 9 楼大佬，能讲下i国网怎么开启webview调试吗？hook不到 2024-9-27 13:00 0
一颗小草雪币： 571 活跃值： (190) 能力值： ( LV2，RANK：10 ) 在线值：发帖 1 回帖 12 粉丝 5 关注私信	一颗小草 10 楼你好，可以发个样本对照学习一下吗 2024-9-27 14:06 0
劫__ 雪币： 351 活跃值： (1032) 能力值： ( LV3，RANK：20 ) 在线值：发帖 2 回帖 10 粉丝 43 关注私信	劫__ 11 楼一颗小草你好，可以发个样本对照学习一下吗某trip软件豌豆荚可找到历史版本 2024-9-27 14:57 0
孤独的街雪币： 1996 活跃值： (2337) 能力值： ( LV2，RANK：10 ) 在线值：发帖 0 回帖 12 粉丝 2 关注私信	孤独的街 12 楼大佬，版本确定是8.0.65吗 2024-9-27 23:41 0
mb_hzvwwcvh 雪币：能力值： ( LV1，RANK：0 ) 在线值：发帖 0 回帖 1 粉丝 0 关注私信	mb_hzvwwcvh 13 楼不清楚是哪个软件 2024-9-30 19:13 0
mb_zfqvurgb 雪币： 6 能力值： ( LV1，RANK：0 ) 在线值：发帖 0 回帖 31 粉丝 1 关注私信	mb_zfqvurgb 14 楼 (b & c) ^ ((~b) & d) => [(c ^ d) & b] ^ d 这个看不懂，为啥 [(c ^ d) & b] ^ d 可以成为 (b & c) ^ ((~b) & d)呢，后者是 f(b,c,d) 2024-10-1 16:27 0
墨穹呢雪币： 2314 活跃值： (3057) 能力值： ( LV3，RANK：20 ) 在线值：发帖 2 回帖 66 粉丝 16 关注私信	墨穹呢 15 楼最新版的算法有变化吗 2024-10-2 20:49 0
寻梦之璐雪币： 731 活跃值： (1637) 能力值： ( LV2，RANK：10 ) 在线值：发帖 2 回帖 46 粉丝 9 关注私信	寻梦之璐 16 楼你好，龙哥的HexSearch大法这个是哪篇文章的哇，我咋找不到。。hexdatasearch这个类从哪找来的。。。哭死 2024-10-21 23:04 0
寻梦之璐雪币： 731 活跃值： (1637) 能力值： ( LV2，RANK：10 ) 在线值：发帖 2 回帖 46 粉丝 9 关注私信	寻梦之璐 17 楼寻梦之璐你好，龙哥的HexSearch大法这个是哪篇文章的哇，我咋找不到。。hexdatasearch这个类从哪找来的。。。哭死问了龙哥，龙哥也想不起来了，哭死 2024-10-21 23:05 0
寻梦之璐雪币： 731 活跃值： (1637) 能力值： ( LV2，RANK：10 ) 在线值：发帖 2 回帖 46 粉丝 9 关注私信	寻梦之璐 18 楼 mb_zfqvurgb (b & c) ^ ((~b) & d) => [(c ^ d) & b] ^ d 这个看不懂，为啥 [(c ^ d) & b] ^ d 可以成为 (b ... 佬，这里你推出来了嘛 2024-10-24 11:04 0
0xEA 雪币： 3098 活跃值： (4222) 能力值： ( LV2，RANK：10 ) 在线值：发帖 4 回帖 88 粉丝 1 关注私信	0xEA 19 楼 mb_zfqvurgb (b & c) ^ ((~b) & d) => [(c ^ d) & b] ^ d 这个看不懂，为啥 [(c ^ d) & b] ^ d 可以成为 (b ... F(b,c,d) = (b & c) \| ((~b) & d) 中间是｜楼主写成^了 2024-10-24 14:48 1
calleng 雪币： 31 活跃值： (3269) 能力值： ( LV2，RANK：10 ) 在线值：发帖 38 回帖 123 粉丝 60 关注私信	calleng 20 楼老板，非常好的文章，谢谢。有样本非常nice 2024-10-27 20:08 0
陈某人雪币： 694 活跃值： (3214) 能力值： ( LV2，RANK：10 ) 在线值：发帖 0 回帖 58 粉丝 2 关注私信	陈某人 21 楼感谢分享最后于 2024-11-14 20:54 被陈某人编辑，原因： 2024-11-14 20:53 0
	游客登录 \| 注册方可回帖回帖表情雪币赚取及消费高级回复

[原创] 记一次xxmain.so 从去花到魔改算法还原

libsgmain.so