CANN/pyasc获取基础配置API文档
asc.language.adv.get_basic_config【免费下载链接】pyasc本项目为Python用户提供算子编程接口支持在昇腾AI处理器上加速计算接口与Ascend C一一对应并遵守Python原生语法。项目地址: https://gitcode.com/cann/pyascasc.language.adv.get_basic_config(basic_m: int, basic_n: int, basic_k: int, intrinsics_limit: bool | None False, batch_loop: bool | None False, bmm_mode: BatchMode | None BatchMode.BATCH_LESS_THAN_L1) → MatmulConfig用于配置BasicBlock模板的参数获取自定义BasicBlock模板。对应的Ascend C函数原型__aicore__ constexpr MatmulConfig GetBasicConfig(const uint32_t basicM, const uint32_t basicN, const uint32_t basicK, const bool intrinsicsLimit false, const bool batchLoop false, const BatchMode bmmMode BatchMode::BATCH_LESS_THAN_L1)参数说明basic_m: 用于设置参数basicM。与TCubeTiling结构体中的baseM参数含义相同Matmul计算时base块M轴长度以元素为单位。basic_n: 用于设置参数basicN。与TCubeTiling结构体中的baseN参数含义相同Matmul计算时base块N轴长度以元素为单位。basic_k: 用于设置参数basicK。与TCubeTiling结构体中的baseK参数含义相同Matmul计算时base块K轴长度以元素为单位。intrinsics_limit: 用于设置参数intrinsicsCheck。参数取值如下false当左矩阵或右矩阵在单核上内轴大于等于65535时不使能循环执行数据的搬入默认值。true当左矩阵或右矩阵在单核上内轴大于等于65535时使能循环执行数据的搬入。batch_loop: 用于设置参数isNBatch。参数取值如下false不使能多Batch默认值。true使能多Batch。bmm_mode: 用于设置参数batchMode。参数取值如下batchMode::BATCH_LESS_THAN_L1多batch数据总和L1 Buffer Size。batchMode::BATCH_LARGE_THAN_L1多batch数据总和L1 Buffer Size。batchMode::SINGLE_LARGE_THAN_L1单batch数据总和L1 Buffer Size。返回值说明MatmulConfig结构体。调用示例mm_cfg asc.adv.get_basic_config(128, 256, 64) mm asc.adv.Matmul(a_type, b_type, c_type, bias_type, mm_cfg) asc.adv.register_matmul(pipe, workspace, mm, tiling) mm.set_tensor_a(gm_a) mm.set_tensor_b(gm_b) mm.set_bias(gm_bias) mm.iterate_all(gm_c)【免费下载链接】pyasc本项目为Python用户提供算子编程接口支持在昇腾AI处理器上加速计算接口与Ascend C一一对应并遵守Python原生语法。项目地址: https://gitcode.com/cann/pyasc创作声明:本文部分内容由AI辅助生成(AIGC),仅供参考