B Eng Final Year Project Presentation

Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A Our Project ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A Basics of image compression (R,G,B) = (20,48,206) 10 = (14,30,CE) 16 = (00001110, 00110000, 11001110) 2 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A Our design Color chooser Encoder About 170 different colors 16 colors, each represented by a 4-bit number Compressed image (FF, FF, FF) 1111 … … (40, 68, 90) 0011 (39,0A, 9D) 0010 (10,10,A3) 0001 (00,00,00) 0000

Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A Our design Input pixels Learn the image Create the codebook Improve the code book Encode the image Code book Compressed image Decode the image (FF, FF, FF) 1111 … … (40, 68, 90) 0011 (39,0A, 9D) 0010 (10,10,A3) 0001 (00,00,00) 0000

ALGORITHM ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A ALGORITHM KOHONEN ALGORITHM MODIFICATIONS 7-BIT ALGORITHM 8-BIT ALGORITHM FUNCTIONS

OVERALL ALGORITHM Kohonen Algorithm IMAGE Learning Encoding Encoder Compressed Image Pixel by pixel encoding LUT +LUT DECOMPRESSION MSB PLANE Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A Weight 16 Neuron 16 … .. … .. Weight2 Neuron 2 Weight 1 Neuron 1 Data 16 Address 16 … .. … .. Data 2 Address 2 Data 1 Address 1

KOHONEN ALGORITHM ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A RED GREEN BLUE w1 w2 Neuron1 Neuron 2 x Input Training Vector

STEPS STEP 1: Find closest Neuron (neuron c) ||X(t) – W c (t)|| = min{||X(t)-W i (t)||} STEP 2: Update Weight of the winning Neuron and the Neurons in the topological neighborhood. Wi(t+1) = Wi(t) + α (t).{X(t) – W(t)} For i Є N c (t)  Neighborhood Iterate STEP1 and STEP2 Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A

ALGORITHM MODIFICATION ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A

MODIFIED ALGORITHM ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Initialization based on Gray scale initialization (r = g = b) Manhattan Distance used instead of Euclidean Distance. It denotes the absolute distance of the neuron from input vector. Minimum Distance neuron can be chosen using binary/recursive searching Usual Kohonen Algorithm : Function f(t) => e.g. d = d0 ( 1- t/T ) Modified : Expanding Sphere. ,[object Object],[object Object],[object Object],STEP5 : Update Neuron weights Wi(t+1) = Wi(t) + α (t).{X(t) – W(t)} Learning rate α Є {1/2, 1/4, 1/8, 1/16…} ,[object Object],[object Object],Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A MODIFICATIONS Neighbor

7-BIT vs. 8-BIT ALGORITHM Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],7-Bit Algorithm 8-Bit Algorithm

SYSTEM ARCHITECTURE BROADCAST ARCHITECTURE ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],NEURON STRUCTURE ,[object Object],[object Object],Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A ARHITECTURE NETWORK STRUCTURE NEURON STRUCTURE GLOBAL CONTROLLER ARCHITECTURAL NOVELTY ALGORITHM – ARCHITECTURE TRANSLATION Neuron 15 Neuron 13 Neuron 11 Neuron 9 Neuron 7 Neuron 5 Neuron 3 Neuron 1 Neuron 14 Neuron 12 Neuron 10 Neuron 8 Neuron 6 Neuron 4 Neuron 2 Neuron 16 GLOBAL CONTROLLER GLOBAL CONTROLLER NEURON 1 NEURON 2 NEURON 3 NEURON 5 NEURON 7 NEURON 11 NEURON 13 NEURON 9 NEURON 15 NEURON 16 NEURON 14 NEURON 12 NEURON 10 NEURON 8 NEURON 6 NEURON 4

ALGORITHM TO ARCHITECTURE ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Initialization: Add W R W G W B 0 00 00 00 1 08 08 08 2 10 10 10 … … … … 15 78 78 78 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Neuron address w R (7) T 1 (9) w G (7) w B (7) T 2 (9) T 2R (7) T 2G (7) T 2B (7) T 2RC (1) T 2GC (1) T 2BC (1) FC (18) Distance from the winning neuron Distance from the input pixel Weight vectors Frequency counter Registers used in the architecture ,[object Object],[object Object],Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A

ARCHITECTURAL NOVELTIES Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A Implementation of 7-bit learning: 7-bit learning, mapping of pixels to an octal space and encoding the MSB plane with the image, are new theoretical ideas that we implemented on hardware. This mode should theoretically create images with a better quality than those encoded using 8-bit mode. This is because the neurons are more closely packed in a smaller space, hence creating a better response on the structure from each pixel. Implementation of the 8-bit and the 7-bit Learning Algorithm: The same hardware can process an image in both 7-bit and 8-bit modes. A single push-button switch on the FPGA board sets the mode for a cycle. This is useful because certain images give a better output on 7-bit mode than on 8-bit or vice versa and they can be compared for later studies. This is done, keeping in mind the need for future upgrading of the functionalities. A module can be added to the design that calculates mean-square error for both 7-bit and 8-bit images and the better one can be selected.

ARCHITECTURAL NOVELTIES Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A Integration of encoding hardware to the learning hardware: The integration of encoding and learning hardware ensures faster compression and reduces the hardware overhead. This is done, keeping in mind the future practical application of the hardware for real-time video compression, rather than just for stand-alone images. Implementation of the variable learning rate: Variable learning rate (using learning rates of 1/2,1/4,…etc.) is a novel feature of this arhitecture. This ensures better updating of neighbors based on its distance from the winning neuron, rather than a fixed updating. The neighbors are updated based on 5 ranges of distances from the winner and the updating distances calculated through theoretical calculations.

ARCHITECTURAL NOVELTIES Parallel architecture for image compression Introduction | Algorithm | Architecture | Results | Conclusion | Q&A Implementation of the learning rate depending on the frequency count: The frequency count value is calculated so that all neurons get equal chance of being the winner. At the same time, the algorithm ensures that a neuron that has been a winner the most number of the time doesn’t get updated as much as the ones that are not as lucky. This is not seen in any other similar algorithms. Implementation of the Neighbor updating: Neighbor-updating together with the winner-updating is another novel feature of our algorithm. This makes the design complicated, but the output quality is considerably improved compared to other architecture.

Parallel architecture for image compression Testing strategy – data verification Data verification ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Modelsim TEST.V TOP.V (Top-level synthesizable module) NEURON_ARRAY GLOBAL_CONTROLLER ENCODER TEST_INPUT Introduction | Algorithm | Architecture | Results | Conclusion | Q&A

Parallel architecture for image compression Testing strategy – Result verification C Program Configuration file Decode.v Encoded_image.dat Decoded_image.dat Output.tiff input.tiff Config.dat Result verification Introduction | Algorithm | Architecture | Results | Conclusion | Q&A TOP.V (Top-level synthesizable module) NEURON_ARRAY GLOBAL_CONTROLLER ENCODER

Parallel architecture for image compression Testing strategy – Result verification Result verification Introduction | Algorithm | Architecture | Results | Conclusion | Q&A ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Parallel architecture for image compression Synthesis Introduction | Algorithm | Architecture | Results | Conclusion | Q&A Technology Libraries Verilog Code Constraints Synthesis tool Prototype Model Schematic optimized net-list In-signal file Out-signal file Xilinx ISE Series 4.1i

Parallel architecture for image compression Synthesis Introduction | Algorithm | Architecture | Results | Conclusion | Q&A ================== Chip top-optimized ================== Summary Information: -------------------- Type: Optimized implementation Source: top, up to date Status: 0 errors, 0 warnings, 0 messages Export: exported after last optimization Chip create time: 0.000000s Chip optimize time: 598.734000s FSM synthesis: ONEHOT Target Information: ------------------- Vendor: Xilinx Family: VIRTEX Device: V800HQ240 Speed: -4 Chip Parameters: ---------------- Optimize for: Speed Optimization effort: Low Frequency: 50 MHz Is module: No Keep io pads: No Number of flip-flops: 3129 Number of latches: 0

Parallel architecture for image compression FPGA Implementation Introduction | Algorithm | Architecture | Results | Conclusion | Q&A ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Parallel architecture for image compression FPGA Implementation Introduction | Algorithm | Architecture | Results | Conclusion | Q&A Upload configuration files and image to the on-board memory Upload the FPGA bit file to the CPLD BAR LED 1 glows - FPGA is configured Press Push Button 1 (START) to start the learning process BAR LED 2 glows – 2 loops completed BAR LED 3 glows – 4 loops completed BAR LED 4 glows – 6 loops completed BAR LED 5 glows – 10 loops completed BAR LED 6 glows – Encoding completed Download the image and convert it to tiff format

Parallel architecture for image compression Conclusion 1. 7-bit process better than 8-bit process Introduction | Algorithm | Architecture | Results | Conclusion | Q&A 2. Suitable for real-time encoding and streaming of video images (About 12 seconds at 5MHz) 3. Use of frequency count register gives better images 4. More the loops, better the image (8-bit, beyond 5 loops). Similar to human learning

Parallel architecture for image compression Recommendation 1. Algorithm can be modified to improve learning time Introduction | Algorithm | Architecture | Results | Conclusion | Q&A 2. Real time video compression with 2 parallel learning chips 3. Both 7-bit and 8-bit in the same hardware 4. MSB plane compression

B Eng Final Year Project Presentation

Recommended

Recommended

More Related Content

What's hot

What's hot (17)

Viewers also liked

Viewers also liked (15)

Similar to B Eng Final Year Project Presentation

Similar to B Eng Final Year Project Presentation (20)

B Eng Final Year Project Presentation