Contents: Currently, several versions of subroutine libraries have been published, each with its own characteristics. The square root algorithm in this library is a fast approximation algorithm that can achieve the same accuracy as the Newton iteration method, but the speed is about twenty times faster, exceeding the speed of double-byte fixed-point division. This subroutine library has made major revisions to the subroutine library in the appendix of the book "Microcontroller Application Programming Technology": (1) According to the current popular IBM PC-based development system's regulations on assembly language, readers do not need to modify it and can use it directly. (2) The floating-point operation subroutine library has been further tested and optimized, and the conversion subroutines between decimal floating-point numbers and binary floating-point numbers have been completely rewritten to improve the operation accuracy and reliability. (3) Several new floating-point subroutines (transfer, comparison, clearing, zero judgment, etc.) have been added to make the work of writing data processing programs simpler and more intuitive. The instructions for use list the most important items: label, entry condition, exit information, affected resources, and stack requirements. For the meaning of each item, please refer to the content of Chapter 6, Section 6.3.7 of "Microcontroller Application Programming Technology". The program listing has four columns: label, instruction, operand, and comment. The comments are as detailed as possible to facilitate the reader's understanding. The use of the subroutine library is as follows: 1. Link all the contents of the subroutine library after the application program and compile it uniformly. The advantage is that it is simple and convenient, but the disadvantage is that the program is too long and a large number of irrelevant subroutines are also included in it. 2. Only link the relevant parts of the subroutine library after the application program and compile it uniformly. Some subroutines need to call some low-level subroutines, which should also be included. The advantage is that the program is compact, but the disadvantage is that the subroutine library needs to be carefully edited.