Architecture Guide
# Table of Contents

TABLE OF CONTENTS .......................................................................................................................... 2
ABOUT THIS GUIDE ............................................................................................................................ 5
OVERVIEW ............................................................................................................................................ 5

FEATURES ........................................................................................................................................... 5
Hardware Features .............................................................................................................................. 5

WISHBONE BUS COMPLIANT INTERFACE ......................................................................................... 5

Software Development Tools ............................................................................................................. 5
BLOCK DIAGRAM .................................................................................................................................. 6

CONFIGURATION OPTIONS .................................................................................................................. 6

REGISTERS ........................................................................................................................................... 7

GENERAL PURPOSE REGISTERS ........................................................................................................ 7

SPECIAL FUNCTION REGISTERS ........................................................................................................ 7

PSW (Program Status Word) (R/W) SFR0 ............................................................................................. 8
RA (R/W) SFR1 .................................................................................................................................... 8
IPC (R/W) SFR2 .................................................................................................................................... 8

TIMER (R/W) SFR3 .............................................................................................................................. 8
IBASE(R/W) SFR4 ............................................................................................................................... 9
USREG(R/W) SFR5 .............................................................................................................................. 9
HWDBG(R/W) SFR6 ............................................................................................................................ 9
HWBP0(R/W) SFR7 .............................................................................................................................. 9
HWBP1(R/W) SFR8 .............................................................................................................................. 9

INSTRUCTIONS ...................................................................................................................................... 9

INSTRUCTION TYPES ......................................................................................................................... 9

ALU Instructions .................................................................................................................................. 10
Load/store Instructions ......................................................................................................................... 10
Jump Instructions ................................................................................................................................. 11
Multiply and Shift Instructions ........................................................................................................... 11
Compare Instructions ........................................................................................................................ 12
Conditional instructions ..................................................................................................................... 12
User Defined Instructions ................................................................................................................ 13

PIPELINE ............................................................................................................................................. 13

Multiply/Shift. ...................................................................................................................................... 14

CACHE OPERATION .......................................................................................................................... 14

INTERRUPT HANDLING .................................................................................................................. 15

HARDWARE DEBUG AID .................................................................................................................... 17

A) HARDWARE SINGLE STEPPING ................................................................................................ 17
B) HARDWARE BREAKPOINTS ........................................................................................................ 17
C) HARDWARE WATCHPOINTS ....................................................................................................... 18

TIMER .................................................................................................................................................. 18

POWER DOWN MODE ....................................................................................................................... 19

www.niktech.com 2
USER INSTRUCTION INTERFACE................................................................. 20
MANIK WISHBONE BUS INTERFACE.......................................................... 22
APPLICATION BINARY INTERFACE ............................................................... 22

Function calling convention ........................................................................ 22
Layout of data in memory ................................................................................ 22

APPENDIX – A. HDL INSTANTIATION TEMPLATE ........................................... 24
APPENDIX – B. ALPHABETIC LIST OF INSTRUCTIONS .................................... 26

ADD  ADD, UPDATE CARRY ........................................................................ 26
ADDC ADD WITH CARRY, UPDATE CARRY.................................................. 26
ADDF CONDITIONAL ADD IF FALSE .......................................................... 26
ADDI ADD WITH IMMEDIATE ...................................................................... 27
ADDIF CONDITIONAL ADD WITH IMMEDIATE, IF FALSE.......................... 27
ADDIT CONDITIONAL ADD WITH IMMEDIATE, IF TRUE............................ 27
ADDT CONDITIONAL ADD, IF TRUE ......................................................... 28
AND  LOGICAL Bitwise AND ....................................................................... 28
ANDI LOGICAL AND WITH IMMEDIATE .................................................... 29
ASR  ARITHMETIC Shift Right (DYNAMIC) ................................................ 29
ASRI ARITHMETIC WITH IMMEDIATE (STATIC) ........................................ 29
CMPEQ COMPARE FOR EQUAL .................................................................. 30
CMPEQI COMPARE WITH IMMEDIATE FOR EQUAL ................................. 30
CMPGT COMPARE FOR GREATER THAN (SIGNED) ..................................... 31
CMPGTI COMPARE WITH IMMEDIATE FOR GREATER THAN (SIGNED)..... 31
CMPS COMPARE FOR HIGHER OR SAME (UNSIGNED) .............................. 31
CMPSI COMPARE FOR LESS THAN OR SAME (UNSIGNED) ....................... 32
CMPLT COMPARE FOR LESS THAN (SIGNED) .......................................... 32
CMPLTI COMPARE WITH IMMEDIATE FOR LESS THAN (SIGNED) .......... 33
J   UNCONDITIONAL Branch (PC relative) .................................................. 33
JF  CONDITIONAL Branch if False .............................................................. 33
JL  Branch to subroutine; Update RA ........................................................ 34
JR  Branch Register Indirect ....................................................................... 34
JRL  Branch to subroutine; Register Indirect; Update RA ......................... 35
JSFR Branch Special Function Register Indirect ....................................... 35
JT  CONDITIONAL Branch if True .............................................................. 36
LDR BH LOAD Register FROM MEMORY .................................................. 36
LDRC LOAD Word(32bits) FROM literal pool (PC relative) ......................... 37
LSL  LOGICAL Shift Left (DYNAMIC) ......................................................... 37
LSLI LOGICAL Shift Left with IMMEDIATE (STATIC) ............................... 37
LSR  LOGICAL Shift Right (DYNAMIC) ..................................................... 38
LSRI LOGICAL Shift Right with IMMEDIATE (STATIC) ............................. 38
MFSFR Move FROM SFR to GPR .................................................................. 38
MOV  UNCONDITIONAL move FROM GPR TO GPR .................................. 39
MOVF CONDITIONAL Move FROM GPR TO GPR, IF FALSE....................... 39
MOVI UNCONDITIONAL Move IMMEDIATE TO GPR ............................... 40
MOVT CONDITIONAL Move FROM GPR TO GPR, IF TRUE ......................... 40
MTSFR Move FROM SFR to GPR .................................................................. 40
MULT MULTIPLY ......................................................................................... 41
MULTI MULTIPLY WITH IMMEDIATE ....................................................... 41
OR  LOGICAL OR ....................................................................................... 41
ST(R)BH Store Word(32 bits) TO MEMORY ................................................ 42
SUB  SUBTRACT; Update CARRY .................................................................. 42
SUBC SUBTRACT WITH CARRY; Update CARRY ........................................ 43
SUBF CONDITIONAL SUBTRACT; IF FALSE .............................................. 43
SUBT CONDITIONAL SUBTRACT; IF TRUE ............................................... 43
<table>
<thead>
<tr>
<th>Code</th>
<th>Instruction</th>
<th>Page</th>
</tr>
</thead>
<tbody>
<tr>
<td>SWINT</td>
<td>SOFTWARE INTERRUPT</td>
<td>44</td>
</tr>
<tr>
<td>SXB</td>
<td>SIGN EXTEND BYTE</td>
<td>44</td>
</tr>
<tr>
<td>SXH</td>
<td>SIGN EXTEND HALF WORD</td>
<td>44</td>
</tr>
<tr>
<td>UDI[0-3]</td>
<td>USER DEFINED INSTRUCTIONS</td>
<td>45</td>
</tr>
<tr>
<td>XHW</td>
<td>EXCHANGE HALF WORD</td>
<td>45</td>
</tr>
<tr>
<td>XOR</td>
<td>LOGICAL XOR</td>
<td>46</td>
</tr>
<tr>
<td>ZXH</td>
<td>ZERO EXTEND</td>
<td>46</td>
</tr>
</tbody>
</table>
About this guide.
This guide describes the Hardware architecture, Application Binary Interface and the instruction set of MANIK – a 32 bit RISC microprocessor.

Overview
MANIK is a 32 bit RISC Microprocessor. The salient features of the processor are listed below.

Features
Hardware Features
- Data Path Width 32 bits, with Four stage pipeline.
- Mixed 16/32 bit instructions for code density
- Von Neumann Architecture (Data and Instruction in the same address space).
- Sixteen, 32 bit General Purpose Registers.
- Four USER defined instructions (with Register File Write back capability).
- In-order issue Out-of-order completion.
- Some Conditional Instructions (Reduces branches & increases code density).
- Built in 32 bit Timer, (count down and interrupt modes)
- Power Down Mode.
- Multiplier and Barrel shifter (2 configurations, size Vs Performance trade off)
- Configurable Data & Instruction cache sizes.

Wishbone bus compliant interface.
- Six External interrupts
- Hardware single step, Breakpoint & Watch point facility

Software Development Tools
- GNU Assembler, Linker (binutils)
- GCC (C,C++ Compiler)
- GDB (Debugger) and Instruction Set Simulator
- Standalone C-Library (RedHat newlib)
### Block Diagram

![Block Diagram](image)

### Configuration Options

<table>
<thead>
<tr>
<th>Option Name</th>
<th>Default Value</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>WIDTH</td>
<td>32</td>
<td>Data Width (should not be changed)</td>
</tr>
<tr>
<td>ADDR_WIDTH</td>
<td>32</td>
<td>Address Width should not exceed Data Width</td>
</tr>
<tr>
<td>USREG_ENABLED</td>
<td>True</td>
<td>Enable USREG special function register</td>
</tr>
<tr>
<td>TIMER_WIDTH</td>
<td>32</td>
<td>Width of the built-in timer counter</td>
</tr>
<tr>
<td>TIMER_CLK_DIV</td>
<td>0</td>
<td>Timer counter decremented every <code>this</code> cycles</td>
</tr>
<tr>
<td>INTR_VECBASE</td>
<td>0</td>
<td>Configuration value of <code>ibase</code> (vector base)</td>
</tr>
<tr>
<td>INTR_SWIVEC</td>
<td>4</td>
<td>Offset of swi vector in the vector table</td>
</tr>
<tr>
<td>INTR_TMRVEC</td>
<td>8</td>
<td>Offset of Timer vector in the vector table</td>
</tr>
<tr>
<td>INTR_EXTVEC</td>
<td>12</td>
<td>Offset of external interrupt vector</td>
</tr>
<tr>
<td>INTR_BERVEC</td>
<td>16</td>
<td>Offset of Buserror interrupt vector</td>
</tr>
<tr>
<td>USER_INST</td>
<td>True</td>
<td>Logic for User defined instruction enabled</td>
</tr>
</tbody>
</table>
ICACHE_ENABLED | True | Instruction Cache enabled
DCACHE_ENABLED | True | Data Cache enabled
ICACHE_ADDR_WIDTH | 10 | Address of icache memory (determines icache size)
DCACHE_ADDR_WIDTH | 10 | Address of dcache memory (determines dcache size)
ICACHE_LINE_WORDS | 1 | Number of words(32 bit) per cache line (Number of read requests issued for a icache miss). Valid values (1,2,4,8)
DCACHE_LINE_WORDS | 1 | Number of words(32 bit) per cache line (Number of read requests issued for a dcache miss). Valid values (1,2,4,8)
ICACHE_SETS | 1 | Number of icache sets. Valid values (1,2,4)
DCACHE_SETS | 1 | Number of dcache sets. Valid values (1,2,4)
SHIFT_SWIDTH | 3 | Number of bits the barrel shifter will shift per cycle (max – 5, min – 1)
MULT_BWIDTH | 32 | Multiplier will take this/32 cycles to complete. (max – 32, min – 2)
HW_BPENB | True | Enable 2 Hardware break points
HW_WPENB | True | Enable 2 Hardware watch points

### Registers

There are sixteen 32-bit General Purpose registers in MANIK and four Special Function Registers.

**General Purpose Registers**

All sixteen General Purpose registers in MANIK are 32 bits wide, and are numbered from R0 to R15. The RESET value of all of them is UNDEFINED. The General Purpose Registers are orthogonal; there is no special or implied usage of any GPR by the hardware.

<table>
<thead>
<tr>
<th>Register Name</th>
<th>Compiler Usage</th>
</tr>
</thead>
<tbody>
<tr>
<td>R0</td>
<td>Stack Pointer</td>
</tr>
<tr>
<td>R1</td>
<td>1st Argument Register / Return Value</td>
</tr>
<tr>
<td>R2</td>
<td>2nd Argument Register / Return Value</td>
</tr>
<tr>
<td>R3</td>
<td>3rd Argument Register</td>
</tr>
<tr>
<td>R4</td>
<td>4th Argument Register</td>
</tr>
<tr>
<td>R5</td>
<td>Temporary register</td>
</tr>
<tr>
<td>R6</td>
<td>Temporary register</td>
</tr>
<tr>
<td>R7</td>
<td>Temporary register</td>
</tr>
<tr>
<td>R8</td>
<td>Register Variable (callee saved)</td>
</tr>
<tr>
<td>R9</td>
<td>Register Variable (callee saved)</td>
</tr>
<tr>
<td>R10</td>
<td>Register Variable (callee saved)</td>
</tr>
<tr>
<td>R11</td>
<td>Register Variable (callee saved)</td>
</tr>
<tr>
<td>R12</td>
<td>Register Variable (callee saved)</td>
</tr>
<tr>
<td>R13</td>
<td>Register Variable (callee saved)</td>
</tr>
<tr>
<td>R14</td>
<td>Register Variable (callee saved)</td>
</tr>
<tr>
<td>R15</td>
<td>Register Variable (callee saved)</td>
</tr>
</tbody>
</table>

**Special Function Registers.**

MANIK core has ten Special Function Registers. These registers can be read using the “mfsfr” instruction, and written to using the “mtsfr” instruction. The registers can be
referred to using their number (SFR0 – SFR10) or using their synonyms, PSW, RA, IPC, TIMER, IBASE, USREG, HWDBG, BP0, BP1, WP0 and WP1.

PSW (Program Status Word) (R/W) SFR0

<table>
<thead>
<tr>
<th>Bit Position(s)</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>SW – Soft Interrupt in progress</td>
</tr>
<tr>
<td>1</td>
<td>EI – External Interrupt in progress</td>
</tr>
<tr>
<td>2</td>
<td>TI – Timer interrupt in progress</td>
</tr>
<tr>
<td>3</td>
<td>IP – Some interrupt in progress</td>
</tr>
<tr>
<td>4</td>
<td>TE – Enable timer interrupt</td>
</tr>
<tr>
<td>5</td>
<td>IE – Interrupt enable (global interrupt enable)</td>
</tr>
<tr>
<td>6</td>
<td>BIP – Backup IP flag. See General Interrupt handling for more details</td>
</tr>
<tr>
<td>7</td>
<td>PD – Power Down flag. See section on Power Down Mode for more details</td>
</tr>
<tr>
<td>8</td>
<td>TF – True false flag, 1 if last compare instruction is true. See Compare Instructions for more details</td>
</tr>
<tr>
<td>9</td>
<td>CY – Carry flag</td>
</tr>
<tr>
<td>10</td>
<td>BD – Flag Bypass Data Cache. See Cache operations for details.</td>
</tr>
<tr>
<td>12</td>
<td>TR – Timer reload flag. See Section Timer for more details.</td>
</tr>
<tr>
<td>12</td>
<td>TU – Timer underflow flag. See Section Timer for more details.</td>
</tr>
<tr>
<td>14</td>
<td>DBER – Data bus error, flag is set when a BUS error is detected during data load or store operation. See section Bus Error for more details.</td>
</tr>
<tr>
<td>15</td>
<td>IBER – Instruction bus Error, flag is set when a BUS error is detected during an instruction fetch. See section Bus Error for more details.</td>
</tr>
<tr>
<td>19:16</td>
<td>SWI – Bits 3:0 of SWI instruction</td>
</tr>
<tr>
<td>20</td>
<td>E10 – Enable External interrupt 0. See External Interrupt for details.</td>
</tr>
<tr>
<td>21</td>
<td>E11 – Enable External interrupt 1. See External Interrupt for details.</td>
</tr>
<tr>
<td>22</td>
<td>E12 – Enable External interrupt 2. See External Interrupt for details.</td>
</tr>
<tr>
<td>26</td>
<td>ES0 – External interrupt 0 status. See External Interrupt for details.</td>
</tr>
<tr>
<td>27</td>
<td>ES1 – External interrupt 1 status. See External Interrupt for details.</td>
</tr>
<tr>
<td>28</td>
<td>ES2 – External interrupt 2 status. See External Interrupt for details.</td>
</tr>
<tr>
<td>29</td>
<td>ES3 – External interrupt 3 status. See External Interrupt for details.</td>
</tr>
<tr>
<td>31</td>
<td>ES5 – External interrupt 5 status. See External Interrupt for details.</td>
</tr>
</tbody>
</table>

RA (R/W) SFR1
Return address register, updated when a jump and link instruction is executed; it can also be updated using the “mtsfr ra,Rn” instruction, in this case the value is available after 2 cycles; there should not be a branch and link instruction within two instructions after a “mtsfr ra, Rn” instruction. See Jump Instructions for more details.

IPC (R/W) SFR2
Address of the instruction that would have executed when the core started processing an interrupt, the special function register can also be updated by software with the “mtsfr ipc,Rn” instruction.

TIMER (R/W) SFR3
Timer counter register, determines the frequency of timer interrupts see Timer for more details. Read returns the cycles remaining to zero.
IBASE(R/W) SFR4
This register determines the base address for the interrupt vector. The register can be configured with a value during system generation, or can be assigned a value by software, see section Interrupt handling for details.

USREG(R/W) SFR5
The register is available when the configuration parameter USREG_ENB is set to true. The register is not used by the processor and can be used a global register by the user application program.

HWDBG(R/W) SFR6
Hardware break/watch point control/status register.

<table>
<thead>
<tr>
<th>Bit Position(s)</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>BPENB (R/O) - Hardware break point is available. ‘1’ when HW_BPENB = true</td>
</tr>
<tr>
<td>1</td>
<td>WPENB (R/O) - Hardware watch point is available. ‘1’ when HW_WPENB = true</td>
</tr>
<tr>
<td>2</td>
<td>BP0_ENB (R/W) - enable BP0, the PC is compared with BP0 register</td>
</tr>
<tr>
<td>3</td>
<td>BP1_ENB (R/W) - enable BP1, the PC is compared with BP1 register</td>
</tr>
<tr>
<td>4</td>
<td>WP0_ENB (R/W) - enable WP0, the load/store address is compared with WP0 register</td>
</tr>
<tr>
<td>5</td>
<td>WP1_ENB (R/W) - enable WP1, the load/store address is compared with WP1 register</td>
</tr>
<tr>
<td>6</td>
<td>BP_HIT (R/O) - Hardware Break point is hit</td>
</tr>
<tr>
<td>7</td>
<td>WP0_HIT (R/O) - Watchpoint 0 is hit</td>
</tr>
<tr>
<td>8</td>
<td>WP1_HIT (R/O) - Watchpoint 1 is hit</td>
</tr>
<tr>
<td>9:31</td>
<td>(Reserved) - zeros</td>
</tr>
</tbody>
</table>

HWBP0(R/W) SFR7
Hardware Breakpoint address , available when HP_BPENB is true

HWBP1(R/W) SFR8
Hardware Breakpoint address , available when HP_BPENB is true

HWWP0(R/W) SFR9
Hardware Watchpoint address , available when HP_WPENB is true

HWWP1(R/W) SFR10
Hardware Watchpoint address , available when HP_WPENB is true

Instructions

Instruction Types
MANIK has six different types of instructions. Most of the instructions are 16 bits wide; the PC relative jump instructions are 32 bit wide. Following is a table of notations used in describing the instruction set.

www.niktech.com
### Notation

<table>
<thead>
<tr>
<th>Notation</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>Rd</td>
<td>General Purpose Register operand R0 – R15</td>
</tr>
<tr>
<td>Rb</td>
<td>General Purpose Register operand R0 – R15</td>
</tr>
<tr>
<td>UIMM4</td>
<td>Unsigned 4 bit immediate operand [0..15]</td>
</tr>
<tr>
<td>SIMM4</td>
<td>Signed 4 bit immediate operand [-8..+7]</td>
</tr>
<tr>
<td>UIMM8</td>
<td>Unsigned 8 bit immediate operand [0..255]</td>
</tr>
<tr>
<td>SIMM8</td>
<td>Signed 8 bit immediate operand [-128..127]</td>
</tr>
<tr>
<td>SFRn</td>
<td>Special Function Register SFR0 – SFR3</td>
</tr>
<tr>
<td>EA</td>
<td>Effective address</td>
</tr>
<tr>
<td>S1I027</td>
<td>Signed 27 bit immediate operand [-256 MB ... +256 MB]</td>
</tr>
<tr>
<td>sxt(x)</td>
<td>Sign Extend x to 32 bit</td>
</tr>
<tr>
<td>zxt(x)</td>
<td>Zero Extend x to 32 bit</td>
</tr>
</tbody>
</table>

### ALU Instructions

All ALU instructions are executed by the arithmetic and logic unit and take one clock cycle to execute. MANIK has two formats for ALU instructions RR (Register, Register) and RI (Register, Immediate)

#### RR Form (Register, Register)

<table>
<thead>
<tr>
<th>Syntax</th>
<th>15:12</th>
<th>11:8</th>
<th>7:4</th>
<th>3:0</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>add Rd, Rb</td>
<td>0x2</td>
<td>0x1</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rd + Rb;</td>
</tr>
<tr>
<td>addc Rd, Rb</td>
<td>0x2</td>
<td>0x3</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rd + Rb + CY;</td>
</tr>
<tr>
<td>sub Rd, Rb</td>
<td>0x2</td>
<td>0x0</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rd - Rb;</td>
</tr>
<tr>
<td>subc Rd, Rb</td>
<td>0x2</td>
<td>0x2</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rd - Rb - CY;</td>
</tr>
<tr>
<td>mov Rd, Rb</td>
<td>0x2</td>
<td>0x4</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rb;</td>
</tr>
<tr>
<td>and Rd, Rb</td>
<td>0x2</td>
<td>0x5</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rd &amp; Rb;</td>
</tr>
<tr>
<td>or Rd, Rb</td>
<td>0x2</td>
<td>0x6</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rd</td>
</tr>
<tr>
<td>xor Rd, Rb</td>
<td>0x2</td>
<td>0x7</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rd ^ Rb;</td>
</tr>
<tr>
<td>sxb Rd, Rb</td>
<td>0x2</td>
<td>0x8</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd[31:8] = Rb[7]; Rd[7:0] = Rb[7:0];</td>
</tr>
<tr>
<td>xhw Rd, Rb</td>
<td>0x2</td>
<td>0x9</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = (Rb &lt;&lt; 16)</td>
</tr>
<tr>
<td>sxh Rd, Rb</td>
<td>0x2</td>
<td>0xa</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd[31:16] = Rb[15]; Rd[15:0] = Rb[15:0];</td>
</tr>
<tr>
<td>zxh Rd, Rb</td>
<td>0x2</td>
<td>0xb</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd[31:16] = 0; Rd[15:0] = Rb[15:0];</td>
</tr>
</tbody>
</table>

#### RI Form (Register, Immediate)

<table>
<thead>
<tr>
<th>Syntax</th>
<th>15:12</th>
<th>11:8</th>
<th>7:4</th>
<th>3:0</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>andi Rd, UIMM8</td>
<td>0x5</td>
<td>UIMM8</td>
<td>7:4</td>
<td>Rd</td>
<td>UIMM8</td>
</tr>
<tr>
<td>addi Rd, SIMM8</td>
<td>0x6</td>
<td>SIMM8</td>
<td>7:4</td>
<td>Rd</td>
<td>SIMM8</td>
</tr>
<tr>
<td>movi Rd, SIMM8</td>
<td>0x7</td>
<td>SIMM8</td>
<td>7:4</td>
<td>Rd</td>
<td>SIMM8</td>
</tr>
</tbody>
</table>

### Load/store Instructions

MANIK can load and store data in 32 bit (word), 16 bit (half word) or 8 bit (byte) chunks. The core aligns the address depending on the data size of the operation; four byte aligned for 32 bit (word) access (lower order 2 bits of address forced to zero); two byte aligned for 16 bit (half word) access (lowest order bit of address forced to zero). When loading data narrower than 32 bits, the upper (MS bits) of the destination register are set to zero, i.e. for half word load bits 31 - 16 of the destination register are set to zero, similarly for 8 bits loads bit positions 31 – 8 of the destination register are set to zero.

<table>
<thead>
<tr>
<th>Syntax</th>
<th>15-12</th>
<th>11-8</th>
<th>7-4</th>
<th>3-0</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

www.niktech.com
In addition to the above mentioned Register Indexed loads, the MANIK instruction set also provides a PC Relative load, this allows for loads of large constants that cannot be loaded into a register using the ALU instructions. The assembler generates a literal pool and puts address of LABEL in the literal pool; the offset of the entry in the literal pool is then inserted into the instruction. The LABEL can be replaced by a constant, in which case the assembler will put the constant value in the literal pool.

<table>
<thead>
<tr>
<th>Syntax</th>
<th>15:12</th>
<th>11:8</th>
<th>7:3</th>
<th>0:0</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>ldrpc</td>
<td>Rd, LABEL</td>
<td>0x4</td>
<td>UIOM8[7:4]</td>
<td>Rd</td>
<td>UIOM8[3:0]</td>
</tr>
</tbody>
</table>

**Jump Instructions**

MANIK has three types of jump instructions a) short PC relative , b) long PC relative and c) Register indirect. Short PC relative jump instructions are 16 bits wide and have a range of +/- 1KB, the Long PC relative jumps are 32 bit wide have a range of +/- 64MB; when the sjt, sjf or sj mnemonics are used the assembler determines the branch distance and generates the appropriate jump instruction, the jt, jf & j instruction forces the assembler to use long branch. Register indirect jump instructions are 16 bits wide. The register indirect form can use both General purpose registers and Special function registers as its source. The assembler inserts a NOP instruction after all “jrl” instruction to compensate for the size difference between the size of a “jl” and “jrl” instruction.

**PC Relative Jumps**

<table>
<thead>
<tr>
<th>Syntax</th>
<th>15:10</th>
<th>9:0</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>SJF Label</td>
<td>0xe4</td>
<td>SIMM10</td>
<td>If TF = 0 then PC += sxt(SIMM10*2);</td>
</tr>
<tr>
<td>SJT Label</td>
<td>0xed</td>
<td>SIMM10</td>
<td>If TF = 1 then PC += sxt(SIMM10*2);</td>
</tr>
<tr>
<td>SJ  Label</td>
<td>0xf4</td>
<td>SIMM10</td>
<td>PC += sxt(SIMM27*2);</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>Syntax</th>
<th>31:27</th>
<th>26:0</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>JF Label</td>
<td>0xe0</td>
<td>SIMM26</td>
<td>If TF = 0 then PC += sxt(SIMM26*2);</td>
</tr>
<tr>
<td>JT Label</td>
<td>0xe8</td>
<td>SIMM26</td>
<td>If TF = 1 then PC += sxt(SIMM26*2);</td>
</tr>
<tr>
<td>J  Label</td>
<td>0xf0</td>
<td>SIMM26</td>
<td>PC += sxt(SIMM26*2);</td>
</tr>
<tr>
<td>JL Label</td>
<td>0xf8</td>
<td>SIMM26</td>
<td>RA = PC + 4; PC += sxt(SIMM26*2);</td>
</tr>
</tbody>
</table>

**Register Indirect Jumps**

<table>
<thead>
<tr>
<th>Syntax</th>
<th>15:8</th>
<th>7:3</th>
<th>0:0</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>JRL Rb</td>
<td>0x1c</td>
<td>0</td>
<td>Rb</td>
<td>RA = PC + 4; PC = Rb;</td>
</tr>
<tr>
<td>JR Rb</td>
<td>0x1d</td>
<td>0</td>
<td>Rb</td>
<td>PC = Rb;</td>
</tr>
<tr>
<td>JSFR SFRn</td>
<td>0x1e</td>
<td>0</td>
<td>SFRn</td>
<td>PC = SFRn</td>
</tr>
</tbody>
</table>
Multiply and Shift Instructions.

Multiply and shift operations are performed by a separate unit. The instructions have two forms RR (Register, Register) and RI (Register Immediate). The Register Immediate form allows immediate values 0...15, to shift by more than 15 bits use the “xhw” instruction in combination with the shift instruction.

### Syntax

<table>
<thead>
<tr>
<th></th>
<th>15:12</th>
<th>11:8</th>
<th>7:4</th>
<th>3:0</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>lsl</td>
<td>Rd, Rb</td>
<td>0x2 0xc</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rd &lt;&lt; Rb;</td>
</tr>
<tr>
<td>lsr</td>
<td>Rd, Rb</td>
<td>0x2 0xd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rd &gt;&gt; Rb; (Logical)</td>
</tr>
<tr>
<td>asr</td>
<td>Rd, Rb</td>
<td>0x2 0xe</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rd &gt;&gt; Rb; (Arithmetic)</td>
</tr>
<tr>
<td>mult</td>
<td>Rd, Rb</td>
<td>0x2 0xf</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = Rd * Rb;</td>
</tr>
<tr>
<td>lsl</td>
<td>UIMM4</td>
<td>0x0 0xc</td>
<td>Rd</td>
<td>UIMM4</td>
<td>Rd = Rd &lt;&lt; UIMM4;</td>
</tr>
<tr>
<td>lsri</td>
<td>UIMM4</td>
<td>0x0 0xd</td>
<td>Rd</td>
<td>UIMM4</td>
<td>Rd = Rd &gt;&gt; UIMM4; (Logical)</td>
</tr>
<tr>
<td>asri</td>
<td>UIMM4</td>
<td>0x0 0xe</td>
<td>Rd</td>
<td>UIMM4</td>
<td>Rd = Rd &gt;&gt; UIMM4; (Arithmetic)</td>
</tr>
<tr>
<td>multi</td>
<td>UIMM4</td>
<td>0x0 0xf</td>
<td>Rd</td>
<td>UIMM4</td>
<td>Rd = Rd * UIMM4;</td>
</tr>
</tbody>
</table>

#### Compare Instructions

Compare instructions update the True/False flag (TF) in the PSW. These instructions will implicitly subtract the second operand from the first operand, and will set the True/False flag depending on the Carry, Overflow and Negative flag (Overflow and Negative flags are intermediate values and are not accessible by software). The compare instructions also have two forms RR (Register, Register) and RI (Register, Immediate); the RI form is only available for “equal-to” and the signed compares.

### Syntax

<table>
<thead>
<tr>
<th></th>
<th>15:12</th>
<th>11:8</th>
<th>7:4</th>
<th>3:0</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>cmpeqi</td>
<td>Rd, SIMM4</td>
<td>0x3 0x0</td>
<td>Rd</td>
<td>SIMM4</td>
<td>TF = (Rd == sxt(SIMM4))</td>
</tr>
<tr>
<td>cmpeq</td>
<td>Rd, Rb</td>
<td>0x3 0x1</td>
<td>Rd</td>
<td>Rb</td>
<td>TF = (Rd == Rb)</td>
</tr>
<tr>
<td>cmphs</td>
<td>Rd, Rb</td>
<td>0x3 0x2</td>
<td>Rd</td>
<td>Rb</td>
<td>TF = ((unsigned) Rd &gt;= (unsigned) Rb)</td>
</tr>
<tr>
<td>cmpls</td>
<td>Rd, Rb</td>
<td>0x3 0x3</td>
<td>Rd</td>
<td>Rb</td>
<td>TF = ((unsigned) Rd &lt;= (unsigned) Rb)</td>
</tr>
<tr>
<td>cmpgt</td>
<td>Rd, Rb</td>
<td>0x3 0x4</td>
<td>Rd</td>
<td>Rb</td>
<td>TF = ((signed) Rd &gt; (signed) Rb)</td>
</tr>
<tr>
<td>cmpgti</td>
<td>Rd, SIMM4</td>
<td>0x3 0x5</td>
<td>Rd</td>
<td>SIMM4</td>
<td>TF = ((signed) Rd &gt; sxt(SIMM4))</td>
</tr>
<tr>
<td>cmpleqi</td>
<td>Rd, SIMM4</td>
<td>0x3 0x6</td>
<td>Rd</td>
<td>SIMM4</td>
<td>TF = ((signed) Rd &lt;= sxt(SIMM4))</td>
</tr>
<tr>
<td>cmpleqi</td>
<td>Rd, SIMM4</td>
<td>0x3 0x7</td>
<td>Rd</td>
<td>SIMM4</td>
<td>TF = ((signed) Rd &lt; sxt(SIMM4))</td>
</tr>
</tbody>
</table>

#### Conditional instructions.

The MANIK instruction set provides conditional forms for some of the ALU instructions. The results of these instructions are written back to the register file depending on the value of the TF flag in the PSW.

### Syntax

<table>
<thead>
<tr>
<th></th>
<th>15:12</th>
<th>11:8</th>
<th>7:4</th>
<th>3:0</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>addit</td>
<td>Rd, SIMM4</td>
<td>0x3 0xe</td>
<td>Rd</td>
<td>SIMM4</td>
<td>If TF = 1 then Rd += sxt(SIMM4)</td>
</tr>
<tr>
<td>addif</td>
<td>Rd, SIMM4</td>
<td>0x3 0xa</td>
<td>Rd</td>
<td>SIMM4</td>
<td>If TF = 0 then Rd += sxt(SIMM4)</td>
</tr>
<tr>
<td>movt</td>
<td>Rd, Rb</td>
<td>0x3 0xc</td>
<td>Rd</td>
<td>Rb</td>
<td>If TF = 1 then Rd = Rb</td>
</tr>
<tr>
<td>movf</td>
<td>Rd, Rb</td>
<td>0x3 0xa</td>
<td>Rd</td>
<td>Rb</td>
<td>If TF = 0 then Rd = Rb</td>
</tr>
<tr>
<td>addt</td>
<td>Rd, Rb</td>
<td>0x3 0xf</td>
<td>Rd</td>
<td>Rb</td>
<td>If TF = 1 then Rd = Rb</td>
</tr>
<tr>
<td>addf</td>
<td>Rd, Rb</td>
<td>0x3 0xb</td>
<td>Rd</td>
<td>Rb</td>
<td>If TF = 0 then Rd = Rb</td>
</tr>
<tr>
<td>subt</td>
<td>Rd, Rb</td>
<td>0x3 0xd</td>
<td>Rd</td>
<td>Rb</td>
<td>If TF = 1 then Rd = Rb</td>
</tr>
<tr>
<td>subf</td>
<td>Rd, Rb</td>
<td>0x3 0x9</td>
<td>Rd</td>
<td>Rb</td>
<td>If TF = 0 then Rd = Rb</td>
</tr>
</tbody>
</table>

www.niktech.com
**User Defined Instructions.**

MANIK allows four user defined instructions, each of these instructions can write back data to the register file. See [USER Instruction interface](#) for details.

<table>
<thead>
<tr>
<th>Syntax</th>
<th>15:12</th>
<th>11:8</th>
<th>7:4</th>
<th>3:0</th>
<th>Sematics</th>
</tr>
</thead>
<tbody>
<tr>
<td>udi0 Rd, Rb</td>
<td>0</td>
<td>0x8</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = User Defined Function (Rd, Rb)</td>
</tr>
<tr>
<td>udi1 Rd, Rb</td>
<td>0</td>
<td>0x9</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = User Defined Function (Rd, Rb)</td>
</tr>
<tr>
<td>udi2 Rd, Rb</td>
<td>0</td>
<td>0xa</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = User Defined Function (Rd, Rb)</td>
</tr>
<tr>
<td>udi3 Rd, Rb</td>
<td>0</td>
<td>0xb</td>
<td>Rd</td>
<td>Rb</td>
<td>Rd = User Defined Function (Rd, Rb)</td>
</tr>
</tbody>
</table>

**Pipeline.**

The following sections describe the current pipeline implementation, it is likely to change in future releases of the processor core. The MANIK core has a four stage bypassed and interlocked pipeline. The pipeline stages are:

a) Instruction Fetch, Decode (IF)

b) Decode Stage (DE)

c) Register File Read (RF)

d) Execute (EX)

The execute (EX) stage consists of four units.

a) Arithmetic and Logic unit. This unit executes addition, subtraction and bitwise logic operations. All instructions executing in this unit has a *latency* of one cycle.

b) Multiply and Shift unit. This unit executes multiply and shift instructions, the MANIK pipeline can continue executing instructions following it in the pipeline as long they do not depend on the result of the multiply or shift instruction.

c) Load / Store unit, load and store instructions take two cycles when accessing data in the internal (on chip) storage, for external memory the number of cycles is variable. The core can continue executing instructions that do not depend on the result of the load instruction.

d) User defined unit. The interface to the User instruction is described in [USER Instruction interface](#). All user instructions are assumed to write back to the register file, the pipeline will continue to execute instructions that do not depend on the result of the user instruction.

The pipeline will generally issue and retire one instruction per cycle; the order of retiring an instruction may not be same as the order of issue. In case a multi-cycle instruction is executing in the MULT/SHIFT, LOAD/STORE or User Defined Unit, instructions following it may proceed to completion if they do not depend on the result of the multi cycle instruction. If a dependent instruction is detected; i.e. an instruction that requires
the result of an incomplete multi-cycle instruction; the pipeline is stalled and no other instructions are allowed to proceed till the dependency has been resolved.

More than one multi cycle instruction may be executing simultaneously, e.g. a load instruction may be issued while a multiply /shift operation is in progress. More than one multi cycle instruction may complete in the same cycle; since the register file has only one write back port the pipeline is stalled till all the values are written back into the register file.

Branch Instructions take 2 cycles for a taken branch, 3 cycles for a not-taken branch. These latencies are when the code is resident in cache; if a cache miss occurs add the number of cycles taken to fetch the instruction from external memory into cache.

Multiply/Shift.
Multiply and Shift instructions are executed by the same unit. This unit has two configurations, a) for size and for b) for performance.

a) In the size optimized configuration multiply is performed 32x2 bits per clock cycle, it can take up to 16 cycles to complete. The shift operation is performed at the rate of one bit per clock cycle.
b) In the performance optimized configuration, the multiplication is performed with a 32x32 bit multiplier and takes 2 cycles to complete. The shifter is capable of shifting 8 bits per cycle; this implies that a shift operation can take between 2 to 5 cycles depending on the amount of shift. Shift amounts 0-7 completes in 2 cycles, 8-15 completes in 3 cycles, 16-23 takes 4 cycles, and shift amounts greater than 23 will take 5 cycles.

**Cache operation.**

MANIK has separate data & instruction caches. The data cache uses a write through policy. The cache sizes can be configured by changing the address line width. Each cache line can be configured to have 1,2,4 or 8 entries. The cache line size determines the number of 32 bit read requests a cache refill operation will perform. Increasing the cache line size will increase the size of the processor and can have adverse effect on the maximum frequency.
Both the data & instruction cache can be configured to have 1, 2 or 4 sets. The cache size is doubled or quadrupled when sets sizes of 2 or 4 chosen.

Although the instruction and data caches are separate, the MANIK core has one Wishbone master bus. In case of a simultaneous cache miss on the data and the instruction caches, the instruction cache is given priority over the data cache. Every store operation will write data into both the cache and the external memory.

The memory space is divided into two regions cacheable and I/O space. The size of the regions are configurable.
By default both the **Data** and **Instruction** cache are enabled. The data cache can be bypassed by setting the BD flag (bit position 10) in the PSW. The contents of the data cache are **not** preserved when the data cache bypass is activated.

The Instruction cache cannot be bypassed, specific instruction cache lines can be invalidated. This is done by setting the II Flag (bit position 11) in the PSW and performing a load or store operation; the instruction cache line corresponding to the address of the load or store will be invalidated. This facility can be used to download programs into memory as well as set break points, or any other situation that requires modifying code.

Fetching data or instructions from I/O space will bypass the corresponding cache.

### Interrupt handling

**MANIK** has five interrupt vectors. The upper 24 Bits of the interrupt vector address is obtained from the **IBASE** (Special Function Register). The lower 8 bits can be configured during system generation the default values are listed below. By writing to the **IBASE** register the software could change the location of the interrupt vectors to a new location (useful during operating system boot process). The operation to complete a write to **ibase** register takes 2 cycles. The configuration value for the **ibase** can be set by the **INTR_VECBASE**.

<table>
<thead>
<tr>
<th>Vector Address</th>
<th>Configuration Value</th>
<th>Description</th>
<th>PSW Bits affected</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>-</td>
<td>Reset Vector</td>
<td>None</td>
</tr>
<tr>
<td>4</td>
<td>INTR_SWIVEC</td>
<td>Software/Single step instruction.</td>
<td>SS flag, SW flag, IP flag</td>
</tr>
<tr>
<td>8</td>
<td>INTR_TMRVEC</td>
<td>Timer Interrupt</td>
<td>TI flag, IP flag</td>
</tr>
<tr>
<td>12</td>
<td>INTR_EXTVEC</td>
<td>External Interrupt</td>
<td>E10-E15 flags</td>
</tr>
<tr>
<td>16</td>
<td>INTR_BERVEC</td>
<td>Bus Error</td>
<td>IBER and DBER Flags</td>
</tr>
</tbody>
</table>

When the processor recognizes an interrupt and branches to the **Interrupt vector** address, the IP flag (PSW bit 3) is copied to the BIP (PSW bit 6) flag and the IP flag is set to ‘1’. When the processor executes a “jsfr ipc” instruction, the BIP flag (PSW bit 6) is copied to the IP flag, and the BIP flag is set to zero. The BIP & IP flags can be updated by writing to the corresponding bit positions in the PSW.

The processor will branch to the interrupt service routine within 4 cycles of the detecting an interrupt. If there were instructions pending in the USER, Multiply/Shift or Load/Store units, they will continue to execute, the processor will stall if the Interrupt service routine accesses any of the registers being defined by any of the pending instructions.

The Timer & External interrupts are not recognized if the global interrupt enable flag IE (PSW bit 5) is zero, or if the processor is already servicing an interrupt; IP flag (PSW bit 3) is set.
The interrupt handler software is responsible for saving and restoring the PSW and any other registers it might use (including the Return Address register, RA (R/W) SFR1). Stack space can be created to save the registers can be created by using the “addi” instruction (this instruction does not update the PSW).

Nested Timer or External interrupts can be handled, by saving the IPC (R/W) SFR2, register on the stack, and re-enabling the interrupts by setting the IP & BIP flags in the PSW to zero.

Reset/Power up
On Reset / Power Up the processor will branch to the address 0 and start executing code from that address. This is a non-maskable interrupt. After Power up the values in General and Special purpose registers are undefined. The software is response for assigning initial values to the registers. The cache values remain unchanged after a reset interrupt.

Software Interrupt
When processor executes a SWI instruction, it executes a branch to address 0x4. The address of the next instruction is put into IPC (R/W) SFR2, register and the SW flag (bit position 0 in the PSW) is set to ‘1’, the IP flag (bit position 3 in the PSW) is also set to ‘1’. It is recommended that the instruction “swint 0x0f” be used as a break point instruction. Bits 3:0 of the instruction are copied to bit positions 19:16 of the PSW.

The software interrupt (“swint”) is NOT disabled by the IP flag, i.e. if an interrupt handler executes an “swint” instruction the processor will branch to the corresponding interrupt vector, and the IPC (R/W) SFR2, register will be updated. The TI & EI0-EI5 will retain their value upon return from the software interrupt. This allows the user to set breakpoints in the interrupt handler, the breakpoint should NOT be set before the interrupt service routine has had a chance to save the registers including the special function registers on the stack. Setting a breakpoint or single stepping in the software interrupt handler will have unpredictable results.

Timer Interrupt
The processor will jump to timer interrupt vector at address 0x8 when a Timer Interrupt occurs; see section Timer for more details. The address of the next instruction to be executed is stored in IPC (R/W) SFR2, and flags TI (PSW bit 2) and IP (PSW bit 3) are set to ‘1’. Note that the global interrupt enable bit IE (PSW bit 5) must be set for the timer to generate interrupts.

External Interrupt
MANIK Core has six External interrupt lines, each of the interrupt lines can be individually disabled by setting the EI0-EI5 bits in the PSW (bit positions 20 through 25); They can be collectively enable or disabled global interrupt enable flag, IE (bit position 5 in the PSW).
All the interrupts are level triggered, when the EXTRN_int line goes high the processor will jump to address 0x0c. The address of the next instruction to be executed is stored in IPC (R/W) SFR2, and flags EI (bit position 1 in the PSW) and IP (bit position 3 in the PSW) are set to ‘1’, the External interrupt status flags ES0-ES5 (bit positions 26 through 31) are set depending on the interrupt being processed. The interrupts are prioritized, with EI0 as the highest priority and interrupt EI5 as the lowest priority.

**Bus Error Interrupt.**

The core will branch to the Bus Error interrupt vector if the WBM_ERR_I (Wishbone bus error) signal is asserted during an instruction fetch or a data load/store operation. This is a non-maskable interrupt. The PSW bit 15 (IBER flag) is set if the exception occurred during an instruction fetch; the DBER flag (bit position 14 in the PSW) is set if the interrupt was generated during a load/store operation. This interrupt is an *imprecise* exception, i.e. the IPC register *may not* point to the instruction following the instruction that caused the exception, the IPC will point to the general vicinity of the instruction that caused the exception. It is recommended that the core do a software restart after a BUS error is signaled.

**Hardware Debug aid**

*a) Hardware Single Stepping.*

This facility is provided to help software debugging. When the Single Step flag in the PSW (Bit position 10) is set, the processor will generate an interrupt for every instruction executed, after it executes a “jsfr ipc” instruction. The Single step flag in the PSW would normally be set in an interrupt handling routine. The flag has no effect on execution till the “jsfr ipc” instruction is executed, the core will generate an interrupt when the instruction at the addresses pointed to by ipc is executed. The processor will branch to the same address as Software Interrupt, bit positions 23:16 of the PSW are all set to ‘1’. The software interrupt handler can determine the cause by examining bit position 0 in the PSW, a ‘1’ in this field means it was invoked by a swint instruction, else it was entered due to a single step instruction. The IPC register will contain the address of the instruction that would have executed next. External and Timer interrupts are disabled during the Single Step operation.

*b) Hardware Breakpoints*

The configuration option HW_BPENB enables two hardware breakpoint registers. This allows the debugger to place breakpoints in the application without modifying the code space. This is useful for setting breakpoints in readonly program area such as flash memory. Hardware breakpoints can be set using the following steps.
a) Enable HW_BPENB configuration option
b) Set the address for the breakpoint in one or both of the two hardware breakpoint registers (BP0 and/or BP1)
c) Enable the corresponding bit(s) in the HWDBG register. BP0_ENB for BP0, or BP1_ENB for BP1.

When the RF stage PC matches one the BP registers (BP0 or BP1), and the corresponding enable bits are set in the HWDBG register, the processor core will branch to the address of Software Interrupt, bit positions 23:16 of the PSW are all set to ‘1’. The software interrupt handler can determine the cause of the interrupt by examining the BP_HIT (bit 6) in the HWDBG register.

Note the interrupt is generated before the instruction with the address match is executed. The IPC register will contain the address of the instruction.

c) Hardware Watchpoints

The configuration option HW_WPENB will enable two hardware watchpoints registers. The debugger can use these registers to be notified when a certain memory address is read(load) or written(store) to. The following steps are required to use this feature.

a) Enable HW_WPENB configuration option
b) Set the address(es) of the memory location in either or both of the watchpoint registers (WP0 and/or WP1).
c) Enable the corresponding bit(s) in the HWDBG register. WP0_ENB for WP0, or WP1_ENB for WP1.

The processor will compare the load/store address to the WP0 and/or WP1 register, if they match and the corresponding enable bit(s) (WP0_ENB and/or WP1_ENB) are set in the HWDBG register, the processor will branch to the address of Software Interrupt, bit positions 23:16 of the PSW are all set to ‘1’. The processor will also set bits WP0_HIT (if address in WP0 matched), and/or WP1_HIT (if address in WP0 matched). The software interrupt handler can use the WP0_HIT, WP1_HIT flags in the HWDBG registers to determine the cause of the interrupt.

Note that the interrupt is generated after the load/store instruction with the matching address is executed.

Timer.

The MANIK core includes a built in 32 bit timer. The Timer has two modes of operations.

a) Interrupt Mode. In this mode the Timer is started by enabling the TE flag (bit position 4 in the PSW) and by writing a value in the TIMER (R/W) SFR3 register. The timer is implemented as a down counter, the value written to the timer is copied to a register, this register is then decremented every clock (“coreclk”) cycle. A Timer interrupt is generated when the value of this register becomes zero. Note the IE flag (bit position 5 in the PSW) must be set for the timer to generate interrupts.
The TIMER (R/W) SFR3, can be updated either in normal operation mode (IP flag is zero) or when the core is processing a timer interrupt (IP flag is one and TI flag is one). Reading the Timer register returns the current value. The timer interrupt can occur in Power Down Mode.

b) **Counter Mode.** In this mode the Timer register can be used to count the number of cycles. To operate the **Timer** in this mode an initial value must be written to the **Timer** register, then a ‘1’ is written into the **TR** bit (position 12) in the PSW. The Timer will start the down counter, it will stop when the counter reaches zero. To determine the number of cycles executed the **Timer** register must be read and subtracted from the reload value. If the **TU** flag (Bit position 12) in the PSW is set to ‘1’ then there was an underflow, to get a more accurate cycle count the reload value should be increased. The largest reload value is 0xffffffff (4294967295 cycles).

**Power Down Mode.**

The processor can be put into a Power Down mode by setting the PD flag (bit position 7 in the PSW). In this mode the Timer & External interrupt are enabled but no instructions are fetched. On receiving an interrupt the processor will branch to the corresponding vector, IPC (R/W) SFR2 register will contain the address of the instruction following the “mtsfr PSW, Rn” instruction that caused the processor to go into Power Down mode. The following assembly language snippet illustrates the usage of Power Down Mode.

```assembly
.text
.global _start
_start:
j main
j null_isr
j timer_isr
j null_isr
main:
    ldrpc r0, _SP_START
    # timer expires after 16 cycles
    movi r1, 16
    mtsfr timer, r1
    # enable timer & interrupt
    movi r1, 0x30
    mfsfr r2, psw
    or r2, r1
    mtsfr psw, r2
stop:
    movi r2, -1
    andi r2, 0x80
    mtsfr r1, psw
    or r1, r2
    mtsfr psw, r1
    # Set Power Down Flag
    sj stop
    # will come here when timer_isr finishes
timer_isr:
    addi r0, -8
    str r1, 0(r0)
    str r2, 4(r0)
    # re enable timer interrupt
    mfsfr r1, psw
    movi r2, 0x10
    or r1, r2
    mtsfr psw, r1
    ldr r2, 4(r0)
    ldr r1, 0(r0)
```
null_isr:
    jsfr ipc

_addi r0,8

USER Instruction interface.

The User defined extensions to the microprocessor can take two 32-bit values as input and write back one 32-bit value to the register file. The contents of the registers Rd, and Rb used in the user instruction are given as two operands to the User extension, the output value from the user extension is written back to the Rd register. The current implementation of the pipeline allows for the core to continue execution of other independent instructions, however only one User instruction can be outstanding at any time. Eight signals are used to interface the User extension to the core of the microprocessor; these are described in the following table. All signals are synchronized on clock signal “coreclk”.

<table>
<thead>
<tr>
<th>Name</th>
<th>Width (Bits)</th>
<th>Direction</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>UINST_uiop</td>
<td>2</td>
<td>Out</td>
<td>Has values 0-3 corresponding to udi0-3</td>
</tr>
<tr>
<td>UINST_uinst</td>
<td>1</td>
<td>Out</td>
<td>1 indicates that UINST_uiop signal is valid. The operands may not be valid.</td>
</tr>
<tr>
<td>UINST_Nce</td>
<td>1</td>
<td>Out</td>
<td>The operand values are valid when ‘0’. The user extension should latch UINST_uiop and the operands when this signal is ‘0’ and UINST_uinst is ‘1’.</td>
</tr>
<tr>
<td>UINST_UIopA</td>
<td>32</td>
<td>Out</td>
<td>1st operand. Value of Rd</td>
</tr>
<tr>
<td>UINST_UIopB</td>
<td>32</td>
<td>Out</td>
<td>2nd operand. Value of Rb</td>
</tr>
<tr>
<td>UINST_ip</td>
<td>1</td>
<td>In</td>
<td>Should be set to ‘1’ when user core is busy. UINST_out is considered valid when this signal is low.</td>
</tr>
<tr>
<td>UINST_out</td>
<td>32</td>
<td>In</td>
<td>Value to be written back to Rd.</td>
</tr>
<tr>
<td>UINST_wbc</td>
<td>1</td>
<td>Out</td>
<td>UINST_out and UINST_ip signals should be held valid till this signal goes high. The core will set this signal to high when the value is written back to the register file.</td>
</tr>
</tbody>
</table>

The following VHDL snippet shows an example state machine which implements the USER interface.

```vhdl
ARCHITECTURE sample OF UDI_sample IS
BEGIN
    -- simulate simple USER instruction
    UDI_PROC : PROCESS (coreclk)
    BEGIN
        IF RISING_EDGE(coreclk) THEN
            CASE curr_udistate IS
                WHEN S0 =>
                    -- idle then latch operands when we should
                    IF UINST_uiop = "00" AND UINST_uinst = '1' AND UINST_Nce = '0' THEN
                        opa_latch <= UINST_uiopA;
                        opb_latch <= UINST_uiopB;
                        UINST_uiop <= '1';
            END CASE;
        END IF;
    END PROCESS UDI_PROC;
END ARCHITECTURE sample;
```
curr_udistate <= S1;
else
  curr_udistate <= S0;
  UINST_uip <= '0';
end if;

when S1 =>
  --- Do Processing WITH opa_latch & opb_latch
  ---
  ...
when SWRITE_BACK =>
  UINST_out <= opa_latch;
  UINST_uip <= '0';
  curr_udistate <= SLAST;
when SLAST =>
  if UINST_wbc = '1' then
    curr_udistate <= S0;
  else
    curr_udistate <= SLAST;
  end if;
  when others => null;
end case;
end if;
end process udi_proc;
MANIK Wishbone Bus Interface.

MANIK uses the open standard Wishbone Bus to interface with external memory and I/O devices. The Wishbone bus specifications can be found at http://www.opencores.org/projects.cgi/web/wishbone/wbspec_b3.pdf. MANIK’s bus interface signals are listed in the following table.

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>In/Out</th>
<th>Width</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>WBM_DAT_I</td>
<td>In</td>
<td>32</td>
<td>Input data from the external device</td>
</tr>
<tr>
<td>WBM_ACK_I</td>
<td>In</td>
<td>1</td>
<td>Input data is valid</td>
</tr>
<tr>
<td>WBM_SEL_O</td>
<td>Out</td>
<td>32</td>
<td>Data output to the external device</td>
</tr>
<tr>
<td>WBM_DAT_O</td>
<td>Out</td>
<td>4</td>
<td>Byte lane select (see following table)</td>
</tr>
<tr>
<td>WBM_WB_O</td>
<td>Out</td>
<td>1</td>
<td>Write enable output</td>
</tr>
<tr>
<td>WBM_STB_O</td>
<td>Out</td>
<td>1</td>
<td>Write/Read Strobe High through entire cycle</td>
</tr>
<tr>
<td>WBM_LOCK_O</td>
<td>Out</td>
<td>1</td>
<td>Cycle in progress no arbitration allowed</td>
</tr>
<tr>
<td>WBM_ADDR_O</td>
<td>Out</td>
<td>32</td>
<td>Address</td>
</tr>
<tr>
<td>WBM_CTI_O</td>
<td>Out</td>
<td>3</td>
<td>Cycle Type</td>
</tr>
<tr>
<td>WBM_BTE_O</td>
<td>Out</td>
<td>2</td>
<td>Burst Type</td>
</tr>
</tbody>
</table>

In the current implementation the Bus transactions generated, does not use the CTI_O and the BTE_O signals. Future implementations will support variable cache line lengths and will use these signals.

The following table shows the values of WBM_SEL_O signal depending on the Address and the size of the load/store requested.

<table>
<thead>
<tr>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>xx</td>
<td>Word</td>
<td>1111</td>
<td>Y</td>
<td>Y</td>
<td>Y</td>
<td>y</td>
</tr>
<tr>
<td>1x</td>
<td>Half</td>
<td>0011</td>
<td>N</td>
<td>N</td>
<td>Y</td>
<td>y</td>
</tr>
<tr>
<td>0x</td>
<td>Half</td>
<td>1100</td>
<td>Y</td>
<td>Y</td>
<td>N</td>
<td>N</td>
</tr>
<tr>
<td>0b</td>
<td>Byte</td>
<td>1000</td>
<td>Y</td>
<td>N</td>
<td>N</td>
<td>N</td>
</tr>
<tr>
<td>01</td>
<td>Byte</td>
<td>0100</td>
<td>Y</td>
<td>N</td>
<td>N</td>
<td>N</td>
</tr>
<tr>
<td>10</td>
<td>Byte</td>
<td>0010</td>
<td>N</td>
<td>Y</td>
<td>N</td>
<td>N</td>
</tr>
<tr>
<td>11</td>
<td>Byte</td>
<td>0001</td>
<td>N</td>
<td>N</td>
<td>Y</td>
<td>N</td>
</tr>
</tbody>
</table>

Application Binary Interface.

This section describes the ABI for the MANIK processor. It describes

- Function calling convention. Includes parameter passing and return values
- Layout of data in memory

Function calling convention

The ABI usage of registers are described in the section General Purpose Registers. The first four integral parameters are passed in registers R1 through R4, the rest of the parameters are passed on the stack. Types that are shorter than 32 bits (bytes & chars) are sign or zero extended to 32 bits when passed as parameters. When a structure is passed as
a parameter a copy of it passed to the called function, the copy of the structure may be passed partially in registers are partially on the stack.

Integral return values are all returned in register R1. Return values less the 64 bits in width are returned in the register pair R1-R2. The lower order word is returned in R1. For return values greater than 64 bits in width the caller must provide a pointer to the return area as the first parameter.

**Layout of data in memory**
MANIK is a BIG Endian machine. The layout of three data types supported shown in the following tables.

**Word Data Type (32 bits) (4 bytes)**

<table>
<thead>
<tr>
<th></th>
<th>N</th>
<th>N+1</th>
<th>N+2</th>
<th>N+3</th>
</tr>
</thead>
<tbody>
<tr>
<td>31</td>
<td></td>
<td></td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>MSB</td>
<td></td>
<td></td>
<td></td>
<td>LSB</td>
</tr>
</tbody>
</table>

**Half Data Type (16 bits) (2 bytes)**

<table>
<thead>
<tr>
<th></th>
<th>N</th>
<th>N+1</th>
</tr>
</thead>
<tbody>
<tr>
<td>15</td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>MSB</td>
<td></td>
<td>LSB</td>
</tr>
</tbody>
</table>

**Byte (8 bits)**

<table>
<thead>
<tr>
<th></th>
<th>N</th>
</tr>
</thead>
<tbody>
<tr>
<td>7</td>
<td>0</td>
</tr>
<tr>
<td>Msbit</td>
<td>Lsbit</td>
</tr>
</tbody>
</table>
Appendix – A . HDL Instantiation template.

The signal names prefixed with WBM_ are the Wishbone Bus Master interface. See section MANIK Bus Interface. The signals names prefixed with UINST_ are meant for the User Instruction interface. See section USER Instruction interface.

```vhdl
component manik2top
port (
    sysclk : in  std_logic;
    coreclk : out std_logic;
    EXTRN_int : in  std_logic_vector(NUM_INTRS-1 downto 0) := (others => '0');
    RESET_int : in  std_logic := '0';
    INTR_ack : out std_logic;
    WBM_DAT_I : in  std_logic_vector (WIDTH-1 downto 0) := (others => '0');
    WBM_ACK_I : in  std_logic := '0';
    WBM_ERR_I : in  std_logic := '0';
    WBM_DAT_O : out std_logic_vector (WIDTH-1 downto 0);
    WBM_SEL_O : out std_logic_vector (3 downto 0);
    WBM_WE_O : out std_logic;
    WBM_STB_O : out std_logic;
    WBM_CYC_O : out std_logic;
    WBM_LOCK_O : out std_logic;
    WBM_ADR_O : out std_logic_vector (ADDR_WIDTH-1 downto 0) := (others => '0');
    WBM_CTI_O : out std_logic_vector (2 downto 0);
    WBM_BTE_O : out std_logic_vector (1 downto 0);
    UINST_uiop : out std_logic_vector(UINST_WIDTH-1 downto 0);
    UINST_uiopB : out std_logic_vector(UINST_WIDTH-1 downto 0);
    UINST_uip : in  std_logic := '0';
    UINST_out : in  std_logic_vector(UINST_WIDTH-1 downto 0) := (others => '0'));
end component;
```

The “sysclk” input signal is the primary input clock to the core. The output signal “coreclk” is the same as “sysclk” in this implementation.

External interrupts should be connected to the “EXTRN_int” signal, these are level triggered signals; see section External Interrupt for more details.
Appendix – B. Alphabetic list of Instructions.

**ADD** **Add , update Carry**

**Operation:**
\[ \text{Rd} = \text{Rd} + \text{Rb}; \ \text{CY} = \text{Carryout} \]

**Assembler Syntax:** `add Rd,Rb`

**Description:**
Add the contents of Register D and the contents of Register B, and put the result in Register D.

**PSW Flags updated:** CY = Carryout;

**Instruction Format:**
```
0 0 1 0 0 0 0 1 Rd Rd Rd Rd Rb Rb Rb Rb
```

**ADDC** **Add with Carry, update Carry**

**Operation:**
\[ \text{Rd} = \text{Rd} + \text{Rb} + \text{CY}; \ \text{CY} = \text{Carryout} \]

**Assembler Syntax:** `addc Rd,Rb`

**Description:**
Add the contents of Register B, the CY bit (in the PSW), and the contents of Register D; and store the result in Register D.

**PSW Flags updated:** CY = Carryout.

**Instruction Format:**
```
0 0 1 0 0 0 1 1 Rd Rd Rd Rd Rb Rb Rb Rb
```

**ADDF** **Conditional Add if False**

**Operation:**
\[ \text{If (TF=0) then Rd} = \text{Rd} + \text{Rb}; \]

**Assembler Syntax:** `addf Rd,Rb`

**Description:**
Add the contents of Register Rd and Rb, and store the result in Rd; perform the operation only if TF = 0.
**ADDI** Add with immediate

**Operation:**
\[ R_d = R_d + \text{sign extended}(S\text{imm}8) \]

**Assembler Syntax:** `addi R_d, S\text{imm}8`

**Description:**
Sign extend 8 bit immediate value to 32 bits and add it to the contents of register Rd, store the result in register Rd. Immediate value can range between –127 to +127.

**PSW Flags updated:** None.

---

**ADDIF Conditional Add with immediate, if False**

**Operation:**
If (TF=0) then
\[ R_d = R_d + \text{sign extended}(S\text{imm}4) \]

**Assembler Syntax:** `addif R_d, S\text{imm}4`

**Description:**
Sign extend 4 bit immediate to 32 bits and add to the contents of Register D, store the result into Register D, if the TF flag in the PSW is equal to 0.

**PSW Flags updated:** None.

---

**ADDIT Conditional Add with immediate, if True**

**Operation:**
If (TF=1) then
\[ R_d = R_d + \text{sign extended}(S\text{imm}4) \]

**Assembler Syntax:** `addit R_d, S\text{imm}4`

---

www.niktech.com
**Description:**
Sign extend 4 bit immediate to 32 bits and add to the contents of Register D, store the result into Register D, if the TF flag in the PSW is equal to 1.

**PSW Flags updated:** None.

**Instruction Format:**

```
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
</tr>
</tbody>
</table>
```

**ADDT**  
*Conditional Add, if True*

**Operation:**

\[
\text{If } (TF=1) \text{ then } \\
Rd = Rd + Rb;
\]

**Assembler Syntax:** `addt Rd,Rb`

**Description:**
Add the contents of register D, and contents of Register B and put the result into Register D, if the TF flag in the PSW = 1.

**PSW Flags updated:** None.

**Instruction Format:**

```
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>
```

**AND**  
*Logical Bitwise And*

**Operation:**

\[
Rd = Rd \& Rb
\]

**Assembler Syntax:** `and Rd,Rb`

**Description:**
Logically AND the contents of Register D with the contents of Register B, and store the results in Register D.

**PSW Flags updated:** None.

**Instruction Format:**

```
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>
```
**ANDI**  
*Logical And with immediate*

**Operation:**
\[ \text{Rd} = \text{Rd} \& \text{zero extend(UIMM8)}; \]

**Assembler Syntax:** `andi Rd,Uimm8`

**Description:**
Zero extend the 8 bit immediate to 32 bits and perform a logical AND with the contents of Register D, the result is store into Register D. The immediate must be a positive integer between 0 and 255.

**PSW Flags updated:** None.

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>U</td>
</tr>
</tbody>
</table>

**ASR**  
*Arithmetic Shift Right (Dynamic)*

**Operation:**
\[ \text{Rd} = (\text{signed}) \text{Rd} >> \text{Rb}[5:0] \]

**Assembler Syntax:** `asr Rd,Rb`

**Description:**
Arithmetic shift right the contents of Rd by Rb[5:0] bits; store the result into register D. If Rb[5:0] is greater than 31 then the result is either 0, or –1 depending on the initial value of bit 31 of register D. Bits 31 through 6 of Register B are ignored.

**PSW Flags updated:** None.

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>U</td>
</tr>
</tbody>
</table>

**ASRI**  
*Arithmetic with immediate (Static)*

**Operation:**
\[ \text{Rd} = \text{Rd} >> (\text{Uimm4}); \]

**Assembler Syntax:** `asri Rd,Uimm`

**Description:**
Arithmetic shift right the contents of Register D by uimm4 bits, and store the result in Register D. Uimm4 must be a positive integer between 0 and 15.
PSW Flags updated: None.

**Instruction Format:**

```
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>U</td>
</tr>
</tbody>
</table>
```

**CMPEQ**  
**Compare for Equal**

**Operation:**

```
if Rd == Rb then
  TF = 1
else
  TF = 0
```

**Assembler Syntax:**  
`cmpeq Rd,Rb`

**Description:**

Compare the contents of Register D, with the contents of Register B; if they are equal set the TF in the PSW to 1 else set the flag to 0.

**PSW Flags updated:** TF flag.

**Instruction Format:**

```
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>
```

**CMPEQI**  
**Compare with Immediate for Equal**

**Operation:**

```
if Rd == sign extend(4 bit immediate) then
  TF = 1
else
  TF = 0
```

**Assembler Syntax:**  
`cmpeqi Rd,simm4`

**Description:**

Sign extend 4 bit immediate to 32 bits and compare with the contents of Register D, if they are equal set the TF flag in the PSW to 1, else set it to 0. The immediate value must be an integer between –8 and +7.

**PSW Flags updated:** TF flag.

**Instruction Format:**

```
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
</tr>
</tbody>
</table>
```
**CMPGT**  Compare for Greater Than (Signed)

*Operation:*

\[
\begin{align*}
\text{if } \text{Rd} & > \text{Rb} \text{ (Signed compare) then} \\
\text{TF} & = 1 \\
\text{Else} \\
\text{TF} & = 0
\end{align*}
\]

*Assembler Syntax: cmpgt Rd,Rb*

*Description:*

Set TF flag in the PSW to 1 if contents of Register D, is greater than contents of Register B else set TF flag to 0 ; the contents of both registers are treated as signed numbers.

*PSW Flags updated: TF flag.*

*Instruction Format:*

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>S</td>
<td>S</td>
<td>S</td>
</tr>
</tbody>
</table>

**CMPGTI**  Compare with Immediate for Greater than (signed)

*Operation:*

\[
\begin{align*}
\text{If } \text{Rd} & > \text{Sign extended (simm4) then} \\
\text{TF} & = 1 \\
\text{Else} \\
\text{TF} & = 0
\end{align*}
\]

*Assembler Syntax: cmpgti Rd,simm4*

*Description:*

Set TF flag to 1 if the contents of Register D, is greater than sign extended 4 bit immediate value, else set TF flag to 0. The contents of Register D is treated as an unsigned number; the Simm4 must be an integer between –8 and +7.

*PSW Flags updated: TF flag.*

*Instruction Format:*

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>S</td>
<td>S</td>
<td>S</td>
</tr>
</tbody>
</table>

**CMPS**  Compare for Higher or Same (Unsigned)

*Operation:*

\[
\begin{align*}
\text{If } \text{Rd} & \geq \text{Rb} \text{ then} \\
\text{TF} & = 1 \\
\text{Else} \\
\text{TF} & = 0
\end{align*}
\]

*Assembler Syntax: cmphs Rd,Rb*
**Description:**
Set TF flag to 1 if the contents of Register D is the equal to or greater than contents of Register B. The contents of both registers are treated as unsigned quantities.

**PSW Flags updated: TF Flag.**

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>

**CMPLS**  
**Compare for Less than or Same (Unsigned)**

**Operation:**

\[
\text{If } \text{Rd} \leq \text{Rb} \text{ then} \\
\text{TF} = 1 \\
\text{Else} \\
\text{TF} = 0
\]

**Assembler Syntax:**  
\text{cmpls} \hspace{1mm} \text{Rd,Rb}

**Description:**
Set TF flag to 1 if the contents of Register D is the equal to or less than contents of Register B. The contents of both registers are treated as unsigned quantities.

**PSW Flags updated: TF flag.**

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>

**CMPLT**  
**Compare for Less Than (Signed)**

**Operation:**

\[
\text{If } \text{Rd} < \text{Rb} \text{ then} \\
\text{TF} = 1; \\
\text{Else} \\
\text{TF} = 0;
\]

**Assembler Syntax:**  
\text{cmplt} \hspace{1mm} \text{Rd,Rb}

**Description:**
Set the TF flag to 1 if the contents of Register D is less than the contents to Register B, else set the TF flag to 0. The contents of both registers are treated as signed numbers.

**PSW Flags updated: TF flag.**

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>
**CMPLTI**  
*Compare with Immediate for Less than (Signed)*

**Operation:**

\[
\text{If } Rd < \text{Sign Extend(simm4)} \text{ then} \\
\quad \text{TF} = 1; \\
\text{Else} \\
\quad \text{TF} = 0;
\]

**Assembler Syntax:** `cmplti Rd, simm4`

**Description:**
Set TF flag to 1 if the contents of Registers D, is less than the sign extended 4 bit immediate value. The contents of Register D is treated as a signed number. The 4 bit immediate value must be an integer between \(-8\) and \(+7\).

**PSW Flags updated:** TF flag.

**Instruction Format:**

```
0 0 1 1 0 1 1 0 Rd Rd Rd Rd S S S S
```

**J**  
*Unconditional Branch (PC relative)*

**Operation:**

\[
\text{PC} = \text{PC} + (\text{Sign extended(Simm27)} << 1);
\]

**Assembler Syntax:** `j [Assembler Label]`

**Description:**
The PC is updated by adding its contents to a scaled sign extended 27 bit displacement field. The displacement represents the offset to the destination address in half words from the branch instruction. This gives the branch instruction a range of +/- 255 MB.

**PSW Flags updated:** None.

**Instruction Format:**

```
1 1 1 1 0 S S S S S S S S S S S S S S
```

**JF**  
*Conditional Branch if False*

**Operation:**

\[
\text{If } \text{TF} = 0 \text{ then} \\
\quad \text{PC} = \text{PC} + (\text{sign extend (simm27)}) << 1; \\
\text{Else} \\
\quad \text{PC} = \text{PC} + 4;
\]

**Assembler Syntax:** `jt <Assembler Label>`
Description:
If the TF flag in the PSW is zero then the PC is updated by adding its contents to scaled sign extended 27 bit displacement field; otherwise the PC is incremented by 4. The displacement represents the offset to the destination address in half words from the branch instruction. This gives the branch instruction a range of +/- 255 MB.

PSW Flags updated: None.

Instruction Format:

<table>
<thead>
<tr>
<th>31</th>
<th>30</th>
<th>29</th>
<th>28</th>
<th>27</th>
<th>26</th>
<th>25</th>
<th>24</th>
<th>23</th>
<th>22</th>
<th>21</th>
<th>20</th>
<th>19</th>
<th>18</th>
<th>17</th>
<th>16</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
</tr>
</tbody>
</table>

JL  
Branch to subroutine; Update RA

Operation:
RA = PC + 4;
PC = PC + ((sign extend (simm27)) << 1;

Assembler Syntax:  jl   <Assembler Label>

Description:
Jump and Link ; The return address (PC+4) is saved in the Special Function Register 1 (RA). The PC is updated by adding its contents to scaled sign extended 27 bit displacement field. The displacement represents the offset to the destination address in half words from the branch instruction. This gives the branch instruction a range of +/- 255 MB.

PSW Flags updated: None.

Instruction Format:

<table>
<thead>
<tr>
<th>31</th>
<th>30</th>
<th>29</th>
<th>28</th>
<th>27</th>
<th>26</th>
<th>25</th>
<th>24</th>
<th>23</th>
<th>22</th>
<th>21</th>
<th>20</th>
<th>19</th>
<th>18</th>
<th>17</th>
<th>16</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
</tr>
</tbody>
</table>

| 15 | 14 | 13 | 12 | 11 | 10 | 9  | 8  | 7  | 6  | 5  | 4  | 3  | 2  | 1  | 0  |
|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|
| S  | S  | S  | S  | S  | S  | S  | S  | S  | S  | S  | S  | S  | S  | S  | S  |

JR  
Branch Register Indirect

Operation:
PC = (Rb) & 0xFFFFFFFF;

Assembler Syntax:  jr   Rb

Description:
The PC is updated with the contents of Register B. The lowest order bit of Register B is ignored.
PSW Flags updated: None.

Instruction Format:

```
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td></td>
</tr>
</tbody>
</table>
```

### JRL  
**Branch to Subroutine; Register Indirect; Update RA**

*Operation:*  
$$RA = PC + 4;$$  
$$PC = (Rb) & 0xFFFFFFFE;$$

*Assembler Syntax:*  
`jrl Rb`

*Description:*  
The return address address (PC+2) is saved in the Special Function Register 1 (RA). The PC is updated with the contents of Register B, the lowest order bit of the register is ignored. The assembler inserts a NOP instruction after all “jrl” instruction to compensate for the size difference between the size of a “jl” and “jrl” instruction.

PSW Flags updated: None.

Instruction Format:

```
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>
```

### JSFR  
**Branch Special Function Register Indirect**

*Operation:*  
$$PC = (SFRn) & 0xFFFFFFFE;$$  
If SFRn == SFR2 (IPC) then  
$$IP_flag = 0 ;$$

*Assembler Syntax:*  
`jsfr SFRn`

*Description:*  
The PC is updated with the contents of Special Function Register (n); the least significant bit of the SFRn is ignored. This instruction is typically used to return from leaf functions; or from return from interrupt routines. If the SFR number is 2 (i.e. Interrupt PC) then the ip_flag in the PSW is also cleared.

PSW Flags updated: None.

Instruction Format:

```
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>Sfrn</td>
<td>Sfrn</td>
<td></td>
</tr>
</tbody>
</table>
```

www.niktech.com
**JT**  
**Conditional Branch if True**

**Operation:**

If TF = 1 then  
PC = PC + (sign extend (simm27)) << 1;  
Else  
PC = PC + 4;

**Assembler Syntax:**  
`jt <Assembler Label>`

**Description:**

If the TF flag in the PSW is set then the PC is updated by adding its contents to scaled sign extended 27 bit displacement field; otherwise the PC is incremented by 4. The displacement represents the offset to the destination address in half words from the branch instruction. This gives the branch instruction a range of +/- 255 MB.

**PSW Flags updated:** None.

**Instruction Format:**

<table>
<thead>
<tr>
<th>31</th>
<th>30</th>
<th>29</th>
<th>28</th>
<th>27</th>
<th>26</th>
<th>25</th>
<th>24</th>
<th>23</th>
<th>22</th>
<th>21</th>
<th>20</th>
<th>19</th>
<th>18</th>
<th>17</th>
<th>16</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
</tr>
</tbody>
</table>

**LDR[BH]**  
**Load Register from memory**

**Operation:**

Rd = MEM[Rb + unsigned IMM4 << {0,1,2}]

**Assembler Syntax:**

- `ldr  Rd, Uimm4(Rb)`  
- `ldrb Rd, Uimm4(Rb)`  
- `ldrh Rd, Uimm4(Rb)`

**Description:**

The effective address of the load operation is computed by adding the scaled unsigned immediate value to the contents of Register B. The scaling is performed by left shifting the unsigned immediate 4 bit value by the size of the load operation. The effective address is aligned to the size boundary of the load operation (i.e. for word loads the lowest two bits are forced to zero; for half word loads the lowest order bit is forced to zero). For load sizes less than word the Destination register (Rd) is zero extended.

**PSW Flags updated:** None.

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>Sz</td>
<td>Sz</td>
<td>0</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
</tr>
</tbody>
</table>

\[Sz = \begin{array}{l} 00 \rightarrow \text{Word} \quad \text{Unsigned IMM4} \ll 2; \\ 01 \rightarrow \text{Half Word} \quad \text{Unsigned IMM4} \ll 1; \end{array}\]
**LDRPC**  Load Word(32bits) from literal pool (PC relative)

**Operation:**

\[ \text{Rd} = \text{MEM}[(\text{PC} + \text{UIMM8} \ll 2 \& 0x\text{FFFFFFFC})] \]

**Assembler Syntax:**

<table>
<thead>
<tr>
<th>ldrpc Rd,&lt;assembler Label&gt;</th>
</tr>
</thead>
<tbody>
<tr>
<td>ldrpc Rd,&lt;Immediate value&gt;</td>
</tr>
</tbody>
</table>

**Description:**

The assembler creates a literal table entry containing the address of the label or the immediate value and sets the eight bit immediate value to point to the table entry. The effective address is computed by left shifting the 8 bit immediate by two and zero extending the result, this is added to the contents of PC and the lower order two bits are forced to zero.

**PSW Flags updated:** None.

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>U</td>
</tr>
</tbody>
</table>

**LSL**  Logical Shift Left (Dynamic)

**Operation:**

\[ \text{Rd} = \text{Rd} \ll \text{Rb}[5:0]; \]

**Assembler Syntax:**

<table>
<thead>
<tr>
<th>lsl Rd,Rb</th>
</tr>
</thead>
</table>

**Description:**

The contents of the Register D, is left shifted by Rb[5:0] bits, and the results are stored in Register B; if Rb[5:0] > 31 then Rd = 0.

**PSW Flags updated:** None.

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>

**LSLI**  Logical Shift Left with immediate (Static)

**Operation:**

\[ \text{Rd} = \text{Rd} \ll \text{Uimm4}; \]

**Assembler Syntax:**

<table>
<thead>
<tr>
<th>lsl Rd, Uimm4</th>
</tr>
</thead>
</table>

**Description:**

The contents of Register D is left shifted by Uimm4 bits and the results are stored back into Register D. The Uimm4 value must be an integer between 0 and 15.
PSW Flags updated: None.

Instruction Format:

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>U</td>
</tr>
</tbody>
</table>

**LSR**  Logical Shift Right (Dynamic)

Operation: 

```
Rd = Rd >> Rb[5:0];
```

Assembler Syntax:  

```
lsr Rd, Rb
```

Description: 

The contents of Register D is right shifted by Rb[5:0] bits; the result is stored back into Register D. If Rb[5:0] is greater than 31 then Rd = 0;

PSW Flags updated: None.

Instruction Format:

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>

**LSRI**  Logical Shift Right with immediate (Static)

Operation: 

```
Rd = Rd >> UIMM4;
```

Assembler Syntax:  

```
lsri Rd, Uimm4
```

Description: 

The contents of Register D is right shifted by Uimm4 bits and the results are stored back into Register D. The Uimm4 value must be an integer between 0 and 15.

PSW Flags updated: None.

Instruction Format:

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>U</td>
</tr>
</tbody>
</table>

**MFSFR**  Move From SFR to GPR

Operation: 

```
Rd = SFRn;
```

Assembler Syntax:  

```
mfsfr Rd, SFRn
```

www.niktech.com
**Description:**
The contents of the Special Function Register n is stored into the Register D.

*PSW Flags updated: None.*

**Instruction Format:**

```
0 0 0 1 0 0 1 0 Rd Rd Rd Rd 0 0 SFR SFR
SFR 00 SFR0 PSW
01 SFR1 RA
10 SFR2 IPC
11 SFR3 TIMER
```

**MOV**  
*Unconditional move from GPR to GPR*

**Operation:**

\[
Rd = Rb;
\]

**Assembler Syntax:**  
\**mov Rd, Rb**

**Description:**
The contents of Register B is stored into Register D.

*PSW Flags updated: None.*

**Instruction Format:**

```
0 0 1 0 0 1 0 0 Rd Rd Rd Rd Rb Rb Rb Rb
```

**MOVF**  
*Conditional Move from GPR to GPR, if False*

**Operation:**

\[
\text{If } TF==0 \text{ then} \\
Rd = Rb;
\]

**Assembler Syntax:**  
\**movf Rd, Rb**

**Description:**
The contents of Register B are conditionally moved to Register D if TF flag is 0.

*PSW Flags updated: None.*

**Instruction Format:**

```
0 0 1 0 1 1 1 0 0 0 0 0 Rd Rd Rd Rd Rb Rb Rb Rb
```
**MOVI**  \_ Unconditional Move immediate to GPR

*Operation:*  \[ Rd = \text{sign extend}(\text{SIMM8}); \]

*Assembler Syntax:*  \text{movi~Rd, Simm8}\n
*Description:*  
The eight immediate value is sign extended and stored into Register D.

*PSW Flags updated:*  None.

*Instruction Format:*

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>S</td>
<td>S</td>
<td>S</td>
<td>S</td>
</tr>
</tbody>
</table>

**MOVT**  \_ Conditional Move from GPR to GPR, if True

*Operation:*  
\[ \text{If TF == 1 then} \]
\[ Rd = Rb; \]

*Assembler Syntax:*  \text{movt~Rd, Rb}\n
*Description:*  
The contents of Register B are conditionally moved to Register D if TF flag is 1.

*PSW Flags updated:*  None.

*Instruction Format:*

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>

**MTSFR**  \_ Move from SFR to GPR

*Operation:*  \[ \text{SFRn} = \text{Rb}; \]

*Assembler Syntax:*  \text{mtsfr~SFRn, Rb}\n
*Description:*  
The value of SFRn is updated with the contents of Register B. Currently only PSW (SFR0), IPC (SFR2) and TIMER (SFR3) can be updated by software.

*PSW Flags updated:*  None.

*Instruction Format:*

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>Sfr</td>
<td>Sfr</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>
**MULT**  **Multiply**

**Operation:**
\[ \text{Rd} = \text{Rd} \times \text{Rb}; \]

**Assembler Syntax:**  \text{mult} \text{ Rd,Rb}

**Description:**
The contents of Register D and the contents of Register B are multiplied and the lower order 32 bits are stored in the Register D. The results are the same regardless of whether the source operands are considered signed or unsigned.

**PSW Flags updated:** None.

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>

**MULTI**  **Multiply with immediate**

**Operation:**
\[ \text{Rd} = \text{Rb} \times \text{Uimm4}; \]

**Assembler Syntax:**  \text{multi} \text{ Rd, Uimm4}

**Description:**
The contents of Register D is multiplied by the zero extended unsigned 4 bit immediate value and the results are stored back in Register D. The Uimm4 must be an integer in between 0 and 15.

**PSW Flags updated:** None.

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>U</td>
</tr>
</tbody>
</table>

**OR**  **Logical OR**

**Operation:**
\[ \text{Rd} = \text{Rd} \mid \text{Rb}; \]

**Assembler Syntax:**  \text{or} \text{ Rd, Rb}

**Description:**
A bitwise logical “or” operation is performed with the contents of Register D & the contents of Register B; the results are stored back into Register D.

**PSW Flags updated:** None.
**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>

**STR[BH]**  
**Store Word(32 bits) to memory**

**Operation:**

\[ \text{MEM}[\text{Rb} + (\text{Uimm4}) \ll \{0,1,2\}] = \text{Rd}; \]

**Assembler Syntax:**

- `str Rd, Uimm4(Rb)`
- `strh Rd, Uimm4(Rb)`
- `strb Rd, Uimm4(Rb)`

**Description:**

Store the contents of Register D into memory. The store operation can be performed in three sizes Word, Half word (h), or Byte (b). The effective address is computed by scaling the Uimm4 value (left shift by 2 for word, left shift by 1 for half word) and adding the result to the contents of Register B. The effective address is aligned to the size boundary of the store operation (i.e. for word loads the lowest two bits are forced to zero; for half word loads the lowest order bit is forced to zero).

**PSW Flags updated:** None.

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>B</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>S$</td>
<td>S$</td>
<td>1</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>U</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>

- **SZ -> 10 Byte**
  - 00 Word  Uimm4 \ll 2
  - 01 Half word Uimm4 \ll 1

**SUB**  
**Subtract; Update carry**

**Operation:**

\[ \text{Rd} = \text{Rd} - \text{Rb}; \]

\[ \text{CY} = \text{carry out from the subtract}; \]

**Assembler Syntax:**

- `sub Rd, Rb`

**Description:**

The contents of Register B is subtracted from the contents of Register D and the result stored in Register D. The CY flag is updated with the carry out from this operation.

**PSW Flags updated:** CY flag carryout from the subtract.

**Instruction Format:**

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>

[www.niktech.com](http://www.niktech.com)
**SUBC**  **Subtract with carry; update carry**  
Operation:  
\[
Rd = Rd - Rb - C;  
\]
\[CY = \text{Carry out from the subtract} \]

Assembler Syntax:  \textit{subc}  \textit{Rd, Rb}

**Description:**  
The contents of Register B and the CY bit is subtracted from the contents of Register D and the result stored in Register D. The CY flag is updated with the carry out from this operation.

**PSW Flags updated:** CY flag carryout from the subtract.

**Instruction Format:**

```
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>
```

**SUBF**  **Conditional Subtract; if false**  
Operation:  
\[
\text{If TF == 0 then}  
Rd = Rd - Rb;  
\]

Assembler Syntax:  \textit{subf}  \textit{Rd, Rb}

**Description:**  
If the TF flag is zero subtract the contents of Register B from the contents of Register B and store the result in Register D. The operation is not performed if the TF is set.

**PSW Flags updated:** None.

**Instruction Format:**

```
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>
```

**SUBT**  **Conditional Subtract; if true**  
Operation:  
\[
\text{If TF == 1 then}  
Rd = Rd - Rb;  
\]

Assembler Syntax:  \textit{subt}  \textit{Rd, Rb}

**Description:**  
If the TF is set then subtract the contents of Register B from the contents of Register B and store the result in Register D. The operation is not performed if the TF is set.
**PSW Flags updated:** None.

**Instruction Format:**

```
  15 14 13 12 11 10  9  8  7  6  5  4  3  2  1  0
0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0
```

---

**SWINT Software Interrupt**

**Operation:**

- IPC = PC;
- PC = 0x8;
- PSW[23:16] = SW {swint code from instruction field};
- IP & SW flag = 1;

**Assembler Syntax:** `swint [0-255]`

**Description:**

Software interrupt; the address of the instruction following the `swint` instruction is captured in the IPC (SFR); IP & SW flags in the PSW are set to 1. The lowest order 8 bits from the instruction field are copied into the PSW bits [23:16].

**PSW Flags updated:** SW, IP flags & SWINT.

**Instruction Format:**

```
  15 14 13 12 11 10  9  8  7  6  5  4  3  2  1  0
0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0
```

---

**SXB Sign extend byte**

**Operation:**

- `Rd = sign extend(Rb[7:0])`

**Assembler Syntax:** `sxtb Rd, Rb`

**Description:**

Sign extend the lower order byte (eight bits), and put the result in Register D.

**PSW Flags updated:** None.

**Instruction Format:**

```
  15 14 13 12 11 10  9  8  7  6  5  4  3  2  1  0
0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0
```

---

**SXH Sign Extend half word**

**Operation:**

- `Rd = sign extend(Rb[15:0])`

---

www.niktech.com
Assembler Syntax:  \textit{sxh} \quad Rd, Rb

\textbf{Description:}
Sign extend the lower order half word of Register \( B \) (16 bits) and put the result in Register \( D \).

\textit{PSW Flags updated: None.}

\textbf{Instruction Format:}

\begin{verbatim}
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>
\end{verbatim}

\textbf{UDI[0-3] \quad User Defined instructions}

\textbf{Operation:}
\( Rd = \text{result of user defined operation}(Rd, Rb) \)

Assembler Syntax:  \textit{udi[0,1,2,3]} \quad Rd, Rb

\textbf{Description:}
For more details see section \textit{USER Instruction interface}.

\textit{PSW Flags updated: None.}

\textbf{Instruction Format:}

\begin{verbatim}
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>FN</td>
<td>FN</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>
\end{verbatim}

\textbf{XHW \quad Exchange half word}

\textbf{Operation:}
\( Rd = (Rb << 16) | ((Rb >> 16) \& 0x0000FFFF); \)

Assembler Syntax:  \textit{xhw} \quad Rd, Rb

\textbf{Description:}
The Upper half word Register \( B \) is placed in the Lower half word of Register \( D \); the lower half word of Register \( B \) is placed in the Upper half word of Register \( D \).

\textit{PSW Flags updated: None.}

\textbf{Instruction Format:}

\begin{verbatim}
<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>
\end{verbatim}
**XOR**  Logical XOR

*Operation:*  \[ Rd = Rd \oplus Rb; \]

*Assembler Syntax:*  `xor Rd, Rb`

*Description:* The contents of Register B is bitwise exclusive or ed with the contents of Register D and the results put into Register D.

*PSW Flags updated:* None.

*Instruction Format:*

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>

**ZXH**  Zero Extend

*Operation:*  \[ Rd = \text{zero extend}(Rb[15:0]) \]

*Assembler Syntax:*  `zxh Rd, Rb`

*Description:* Zero extend the lower half word of Register B and put the result in Register D.

*PSW Flags updated:* None.

*Instruction Format:*

<table>
<thead>
<tr>
<th>15</th>
<th>14</th>
<th>13</th>
<th>12</th>
<th>11</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rd</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
<td>Rb</td>
</tr>
</tbody>
</table>