Friday, March 28, 2008

DM6437 Architecture

The TMS320C64x+™ DSPs (including the TMS320DM6437 device) are the highest-performance fixed-point DSP generation in the TMS320C6000™ DSP platform. The DM6437 device is based on the third-generation high-performance, advanced VelociTI™ very-long-instruction-word (VLIW) architecture developed by Texas Instruments (TI), making these DSPs an excellent choice for digital media applications. The C64x+™ devices are upward code-compatible from previous devices that are part of the C6000™ DSP platform. The C64x™ DSPs support added functionality and have an expanded instruction set from previous devices.

With performance of up to 4800 million instructions per second (MIPS) at a clock rate of 600 MHz, the C64x+ core offers solutions to high-performance DSP programming challenges. The DSP core possesses the operational flexibility of high-speed controllers and the numerical capability of array processors. The C64x+ DSP core processor has 64 general-purpose registers of 32-bit word length and eight highly independent functional units—two multipliers for a 32-bit result and six arithmetic logic units (ALUs). The eight functional units include instructions to accelerate the performance in video and imaging applications. The DSP core can produce four 16-bit multiply-accumulates (MACs) per cycle for a total of 2400 million MACs per second (MMACS), or eight 8-bit MACs per cycle for a total of 4800 MMACS.

The DM6437 also has application-specific hardware logic, on-chip memory, and additional on-chip peripherals similar to the other C6000 DSP platform devices. The DM6437 core uses a two-level cache-based architecture. The Level 1 program memory/cache (L1P) consists of a 256K-bit memory space that can be configured as mapped memory or direct mapped cache, and the Level 1 data (L1D) consists of a 640K-bit memory space —384K-bit of which is mapped memory and 256K-bit of which can be configured as mapped memory or 2-way set-associative cache. The Level 2 memory/cache (L2) consists of a 1M-bit memory space that is shared between program and data space. L2 memory can be configured as mapped memory, cache, or combinations of the two.

The peripheral set includes:
  • 2 configurable video ports;
  • A 10/100 Mb/s Ethernet MAC (EMAC) with a management data input/output (MDIO) module;
  • A 4-bit transmit, 4-bit receive VLYNQ interface;
  • An Inter-Integrated Circuit (I2C) Bus interface;
  • Two MultiChannel Buffered Serial Ports (McBSPs);
  • A Multichannel Audio Serial Port (McASP0) with 4 serializers;
  • 2 64-bit general-purpose timers each configurable as 2 independent 32-bit timers;
  • 1 64-bit watchdog timer;
  • A user-configurable 16-bit Host-Port Interface (HPI);
  • Upto 111-pins of General-Purpose Input/Output (GPIO) with programmable interrupt/event generation modes, multiplexed with other peripherals;
  • 2 UARTs with hardware handshaking support on 1 UART;
  • 3 Pulse Width Modulator (PWM) peripherals;
  • 1 high-end Controller Area Network (CAN) controller [HECC];
  • 1Pperipheral Component Interconnect (PCI) [33 MHz]; and
  • 2 glueless external memory interfaces: an asynchronous external memory interface (EMIFA) for slower memories/peripherals, and a higher speed synchronous memory interface for DDR2.

No comments: