Page 233

Contents | 233 of 320

Download PDF | | Reference URL | Gallery View | Parent Record

Publication Type	technical report
School or College	College of Engineering
Department	Computing, School of
Creator	Gu, Jun
Title	Parallel Algorithms and Architectures for Very Fast AI Search
Date	1989-08
Description	A wide range of problems in natural and artificial intelligence, computer vision, computer graphics, database engineering, operations research, symbolic logic, robot manipulation and hardware design automation are special cases of Consistent Labeling Problems (CLP). CLP has long been viewed as an efficient computational model based on a unit constraint relation containing 2N-tuples of units and labels which specifies which N-tuples of labels are compatible with which N-tuples of units. Due to high computation cost and design complexity, most currently best-known algorithms and computer architectures have usually proven infeasible for solving the consistent labeling problems. Efficiency in CLP computation during the last decade has only been improved a few times. This research presents several parallel algorithms and computer architectures for solving CLP within a parallel processing framework. For problems of practical interest, 4 to 10 orders of magnitude of efficiency improvement can be easily reached. Several simple wafer scale computer architectures are given which implement these parallel algorithms at a surprisingly low cost.
Type	Text
Subject	parallel algorithms; CLP; consistent labeling problems; computer programming; computer science
Language	eng
Bibliographic Citation	Gu, J. (1989). Parallel algorithms and architectures for very fast AI search.
Series	University of Utah Computer Science Technical Report
Relation is Part of	ARPANET
Format Medium	application/pdf
Format Extent	116,876,932 bytes
File Name	Gu-Parallel_Algorithms.pdf
Conversion Specifications	Original scanned with Kirtas 2400 and saved as 400 ppi uncompressed TIFF. PDF generated by Adobe Acrobat Pro X for CONTENTdm display
ARK	ark:/87278/s69s3s9x
Setname	ir_computersa
ID	99969
Reference URL	https://collections.lib.utah.edu/ark:/87278/s69s3s9x

Page Metadata

Title	Page 233
Setname	ir_computersa
ID	99881
OCR Text	Show 215 cessor arrays. The complete architecture can be globally synchronized or self-time controlled. The parallel mDRA algorithm performed on parallel mDRA architecture is il-lustrated in Figure 6.40. It has optimal time complexity, i.e., O(nm). Meanwhile, its convergence property has been greatly improved. Real algorithm run and simulation indicate that this algorithm is many orders faster than the parallel DRA5 algorithm. Three advanced parallel mDRA architectures were designed during 1988 (68]. Some implementation issues for the parallel rnDRA architecture are discussed in the next sections. 6. 7 Wafer-Scale Integration of Parallel DRA Architectures VLSI circuits offer a wonderful computing medium with incredible computing power and permit much spatial parallelism within a 2-dimensional plane, while in any sequential uniprocessor machine only !-dimensional serial computation is possible. In order to map the optimal parallel DRAS and mDRA algorithms onto a VLSI architecture to solve large size engineering problems, one has to deal with the following two critical challenges: (I) 1/0 Problem. 1/0 design in the DRAS and mDRA implementation are important. It may become a bottleneck for the entire system, if we still follow the track of conventional chip level design. (2) Extension to Large Scale Computation. When problem size (n and m) in-creases, or one selects a very large granule size in processor implementation, the complete design has to be implemented on separate chips, thus increasing the per-formance penalties resulting from off-chip communication. This is due mainly to the time required to drive the package pins and also the expense of initializing
Reference URL	https://collections.lib.utah.edu/ark:/87278/s69s3s9x/99881