Knowledge-based Modeling: An Expert Centered Data Driven Approach
Motivation
The key to solving engineering problems is to develop accurate models. For many complex engineering problems, first principle models cannot be developed effectively. In such cases, the traditional approach is to develop an empirical model based on experimental or historical data through techniques such as multiple linear regression, neural networks, or support vector machine. The drawback is that prior knowledge cannot be effectively incorporated into the model building process. The resultant model is strongly biased towards the data collected and its robustness is questionable when data is sparse. An alternative is to develop a symbolic expert system by interviewing domain experts and convert their knowledge into IF-THEN rules. Since expert systems are not developed based on actual data, they are biased towards the experts’ heuristics and are usually rigid and brittle (adversely affected by noise). In addition, acquisition of domain knowledge is a tedious and time consuming process, commonly known as the knowledge acquisition bottleneck. We believe the integration of knowledge embedded in data and those possessed by experts can lead to a superior modeling approach. This motivates us to develop a knowledge-based modeling approach, where automatic data analysis tools are constructed to assist experts in building and optimizing models.
Goal
The goal is to develop a generic framework for knowledge-based modeling and a set of enabling tools that allow experts to quickly build an accurate and robust model in specific domains. We have selected IF-THEN linguistic rules (originated from fuzzy logic) as a unified knowledge representation scheme because of their compatibility to expert heuristic reasoning and their adaptability to actual data. The task is then two-fold:
(1) Extract easy to understand IF-THEN linguistic rules from dataset so experts can incorporate their knowledge by adding and revising these rules, thus achieve knowledge integration and alleviate the knowledge acquisition bottleneck.
(2) Automatically tune the integrated rules to obtain a knowledge-based model with optimal accuracy and interpretability, thus ensure model robustness and its user acceptance.
Technical Approach
The framework for knowledge-based
modeling is shown in Figure 1. The set
of enabling tools, including cluster analysis, rule extraction, approximate
reasoning, and rule tuning, are collectively maintained by a digital knowledge
assistant (DKA). The model building
process is as follows. It starts with experts
identifying input and output parameters of the problem domain. The DKA then conducts an initial analysis of
the collected dataset to characterize input and output parameters in terms of
cluster analysis. This allows the experts
to visualize the data distribution. They
may refine the clusters and provide linguistic terms (such as small, medium,
and large) to describe the parameters based on their expertise and the observed
clusters. The DKA subsequently extract
linguistic IF-THEN rules from the dataset and present them to the experts. The experts may add, delete, or revise these
rules to form an integrated knowledge base.
Now, the DKA can utilize an approximate reasoning technique (originated
from fuzzy inference) to operate on the rules and compare the result to
collected data. It then uses non-linear
optimization techniques to fine tune these rules based on accuracy and
interpretability requirements specified by the experts. The enabling techniques are briefly
summarized as follows.

Figure 1. Generic framework for knowledge-based modeling.
Cluster Analysis
The purpose of clustering is to separate a dataset into a number of groups (called clusters) based on some measure of similarity, in order to observe interesting structures of the dataset and draw conclusions. Cluster analysis is a well-established research area where many algorithms are available. Commonly used clustering algorithms include k-means clustering and fuzzy c-means clustering. Theses algorithms require the number of clusters to be determined. A recent algorithm that does not have this restriction is Yager and Filev’s (1994) mountain clustering, which was further improved by Chiu (1995) and termed subtractive clustering. The basic idea of subtractive clustering is as follows. First, a mountain function is established such that a data point will have a higher mountain value if it has more nearby data points. The data point with the highest mountain value is then treated as the first cluster center. Mountain values for all data points are then updated to bring down the mountain value of the first cluster center and that of its nearby points. In this way, the second cluster center (which has the maximum updated mountain value) will be found somewhere else. This procedure repeats until the current maximum mountain value falls below a certain level compared to the original maximum mountain value. It is possible to use a single adjustable parameter, i.e., cluster radius, to control the number of clusters in subtractive clustering.
Rule Extraction
Rule extraction can be done in a number of different ways. A popular method is decision tree induction. The principle is as follows. The root of the decision tree is established and an input parameter, say P, is selected based on a certain criterion (e.g., information gain). Say P is described using n linguistic terms. Then n edges are established, with each edge corresponds to one linguistic term. The original dataset is thus partitioned into n new datasets based on A. If the data samples within a new dataset all have identical outcomes, then stop growing that branch of the tree. Otherwise, establish a new node and select another input parameter to grow the tree. Each leaf in the constructed decision tree would represent a rule. An alternative is to use the result of cluster analysis, namely, represent each cluster with a prototype and treat it as a rule. When this approach is used, care must be taken to control linguistic term/cluster mapping in order to ensure rule interpretability.
Approximate Reasoning
Approximate reasoning, also known as fuzzy inference, operates on qualitative linguistic rules to generate quantitative results. It requires each linguistic term to be quantified using a membership function so that quantitative inputs can be fuzzified to invoke linguistic rules. It also requires a defuzzification mechanism to convert rule inference results to quantitative outputs. Although fuzzy inference follows a standard process (Jang et al. 1997), there are considerable freedom in choosing membership functions and the deffuzification mechanism. To ensure computation efficiency, the weighted average method is used for deffuzification. To introduce greater flexibility in quantifying linguistic terms, we use an intersected sigmoid membership function. It is a unique combination of two sigmoid functions, one open left and the other open right. Compared to the commonly used Gaussian membership function, it can be adjusted to approximate a much wider data distribution.
Rule Tuning
Rule tuning is accomplished through the use of an Adaptive Mamdani Fuzzy Model (AMFM) as shown in Figure 2. Taken a leaf out of neuro-fuzzy modeling, AMFM converts the approximate reasoning process into a five-layer network and map the rules into the network architecture. Membership functions and rules are then automatically adjusted through error back-propagation and non-linear optimization.

Figure 2. Adaptive
Mamdani fuzzy model for rule tuning.
In addition
to these enabling techniques, we are also working on data processing techniques
including data cleansing and dimensionality reduction. Data cleansing refers to filling in missing
values for incomplete data samples so they can be included in model building,
thus maximizing data utilization. Dimensionality
reduction is the determination of critical parameters that are necessary and
sufficient for model building, which is critical to improve model accuracy and
robustness, especially when data is sparse.
Applications
The knowledge-based modeling approach has been proven successfully in various real-world design and manufacturing applications. These applications are briefly summarized in Table 1.
Table 1. Summary of applications.
|
Application |
Input |
Output |
Result |
Collaborator |
|
Engine Nacelle Forming |
Part Geometry, Part
Material Type, Die Type |
Process Method: Bare Punch,
Blow Down, Blow Up |
Extracted rules for forming
process planning to shorten new operator learning curve from 6 months to 1
month (estimated) |
|
|
Atomizer Design |
Orifice Diameter, Ratio of
Length to Width of Slots, Ratio of Spin Chamber to Orifice Diameter |
Fuel Flow, Spray Angle, Sauter Mean Diameter |
Similar response surfaces compared
to those produced by a regression model, with improved R2 (2% in
average) and F-statistics (68% in average) after data cleansing |
|
|
Mesh Generation |
2-Dimensional Part Geometry |
Quadrilateral Multi-Block |
A software tool that allows
users to automatically generate multi-block for quadrilateral meshes |
|
|
Seamless Tubing |
Disc Speeds, Motor Loads,
Temperature, Disc Position, Plug, Reeler, Tube Weight, Tube Length |
Process Yield |
Reduced yield prediction
model parameters by 80% (from 10 to 2) while maintaining a 93% predictive
accuracy |
|
|
Thermal Paint Calibration |
Thermal image mapped into
R/G/B values |
Surface Temperature |
Achieved 97.75% accuracy in
mapping image to temperature |
|
|
Temperature Profiling |
Oven Power, Plastic Preform Depth |
Preform Temperature |
Generated temperature
profile comparable to that obtained using theoretical model with dramatically
reduced model development time |
|
Technical Contact
Dr. Samuel H. Huang, Associate Professor and Director
Intelligent
Department of Mechanical, Industrial and Nuclear Engineering
Phone: (513)556-1154 Fax: (513)556-3390
E-mail: sam.huang@uc.edu
Project Team
|
Member |
Title |
Time |
|
John Shi |
Post-Doc |
January 2002 – present |
|
Ranganath Kothamasu |
Ph.D. |
September 1999 – present |
|
Kai Hu |
M.S. |
March 2004 – present |
|
Hao Xing |
Ph.D. |
September 1998 – August 2003 |
|
Saurabh Dwivedi |
M.S. |
January 2001 – December 2003 |
|
Niharika Rapur |
M.S. |
September 2002 – August 2003 |
|
Sumit Maloo |
M.S. |
January 2001 – December 2001 |
|
Nikhil Pujari |
M.S. |
January 2001 – December 2001 |
|
Guruprasad Banbekar |
M.S. |
September 1999 – August 2001 |
|
Yogesh Shiralkar |
M.S. |
January 2000 – December 2000 |
Acknowledgement
We would like to thank the Ohio Aerospace Institute (OAI), NASA, and the Ohio Department of Development (ODOD) for supporting the following projects:
We would also like
to thank the following companies for research collaboration through student
internships and engineering support for real-world applications.
Publications
1. S. Dwivedi, S. H. Huang, J. Shi, and W. H. VerDuin, “Yield Prediction for Seamless Tubing Processes: A Computational Intelligence Approach,” accepted by Computers in Industry.
2. S. H. Huang, “Dimensionality Reduction in Automatic Knowledge Acquisition,” IEEE Transactions on Knowledge and Data Engineering, Vol. 15, No. 6, 2003, pp. 1364-1373.
3. H. Xing, S. H. Huang, and J. Shi, “Rapid Development of Knowledge-based System Via Integrated Knowledge Acquisition,” Artificial Intelligence for Engineering Design, Analysis and Manufacturing, Vol. 17, No. 3, 2003, pp. 221-234.
4. S. H. Huang, R. Kothamasu, Y. C. Shiralkar, and D. Bogstad, “Prediction of Plastic Preform Temperature Profile and Modeling Perspective,” International Journal of Manufacturing Science and Technology, Vol. 4, No. 2, 2003, pp. 56-83.
5. S. H. Huang and H. Xing, “Extract Intelligible and Concise Fuzzy Rules from Neural Networks,” Fuzzy Sets and Systems, Vol. 132, No. 2, 2002, pp. 233-243.
6. S. H. Huang, H. Xing, and M. Benjamin, “Automated Knowledge Acquisition for Design and Manufacturing: The Case of Micromachined Atomizer,” Journal of Intelligent Manufacturing, Vol. 12, No. 4, 2001, pp. 377-391.
7. S. H. Huang, H. Xing, and G. Wang, “Intelligent Classification of Drop Hammer Forming Process Method,” International Journal of Advanced Manufacturing Technology, Vol. 18, No. 2, 2001, pp. 89-97.
8. S. H. Huang and M. R. Endsley, “Providing Understanding of the Behavior of Feedforward Neural Networks,” IEEE Transactions on Systems, Man, and Cybernetics, Vol. 27, No. 3, 1997, pp. 465-474.
9. S. H. Huang, H.-C. Zhang, S. Sun, and H. Li, “Function Approximation and Neural-Fuzzy Approach to Machining Process Selection,” IEEE Transactions on Components, Packaging, and Manufacturing Technology - Part C: Manufacturing, Vol. 19, No. 1, 1996, pp. 9-18.
10. S. H. Huang and H.-C. Zhang, “Neural-Expert Hybrid Approach for Intelligent Manufacturing: A Survey,” Computers in Industry, Vol. 26, No. 2, 1995, pp. 107-126.
11. H.-C. Zhang and S. H. Huang, “Applications of Neural Networks in Manufacturing: A State-of-the-Art Survey,” International Journal of Production Research, Vol. 33, No. 3, 1995, pp. 705-728.
12. H.-C. Zhang and S. H. Huang, “A Fuzzy Approach for Process Plan Selection,” International Journal of Production Research, Vol. 32, No. 6, 1994, pp. 1265-1279.
13. S. H. Huang and H.-C. Zhang, “Artificial Neural Networks in Manufacturing: Concepts, Applications, and Perspectives,” IEEE Transactions on Components, Packaging, and Manufacturing Technology - Part I, Vol. 17, No. 2, 1994, pp. 212-228.
14. S. H. Huang, J. Shi, S. Maloo,
and E. Steinthorsson, “Automatic Generation of
Quadrilateral Multi-Block Topology for FEA/CFD Applications,” 2002 IEEE International Conference on
Industrial Technology,
15. R. Kothamasu, S. H.
Huang, and
16. Marinescu, R. Felix, R. Kothamasu,
S. H. Huang, “Prediction of 3-D Surface Texture Using Neuro-Fuzzy
Techniques,” 2002 International CIRP Design Seminar,
17. S. H. Huang, R. Kothamasu,
and H. Xing, “Automatic Generation of Pragmatic and Intelligible Fuzzy
Rules,” IEEE Systems, Man and Cybernetics 2001,
18. S. H. Huang, H. Xing, and R. Kothamasu, “An Integrated Approach to Manufacturing
Knowledge Acquisition: With Application to Process Planning for Drop Hammer
Forming,” IJCAI-01 Workshop on Artificial Intelligence and Manufacturing,
19. P. Svenmarck, S. H.
Huang, and E. P. Blasch, “Transparent Neurofuzzy Models of Human Performance,” Proceedings of
the Fifth Annual Conference on Human Interaction with Complex Systems (HICS),
April 30-May 2, Beckman Institute, University of Illinois, 2000.
20. S. H. Huang and H. Xing, “Heuristic
Manufacturing Knowledge Modeling: A Neural Network/Fuzzy Logic Hybrid
Approach,” the 4th International Conference on Frontiers of Design and
Manufacturing, pp. 80-90,
21. S. H. Huang and H.-C. Zhang, “On the Use of
Knowledge-Based Connectionist Model in CAPP systems,” Manufacturing Science
and Engineering, PED-Vol. 68-1, ASME, 1994, pp.
65-74.
22. S. H. Huang and H.-C. Zhang, “Neural Networks
in Manufacturing,” Robotics and Manufacturing: Proceedings of the Fifth
International Symposium on Robotics and Manufacturing, pp. 743-748, August,
1994
23.
S. H.
Huang and H.-C. Zhang, “Overview of Neural Networks in Manufacturing,” Proceedings
of 1993 IEEE/CHMT International Electronics Manufacturing Technology Symposium,
Santa Clara, CA, pp. 177-190, October, 1993.
24. S. H. Huang and H.-C. Zhang, “A Neural-Expert
Hybrid Knowledge Acquisition Model for Process Planning,” AUTOFACT'93
Conference Proceedings,