Linear Separability in Descent Procedures for Linear Classifiers

Basu, Mitra; Ho, Tin Kam

doi:10.1007/978-1-84628-172-3_4

Mitra Basu³ &
Tin Kam Ho⁴

Part of the book series: Advanced Information and Knowledge Processing ((AI&KP))

1139 Accesses

Summary

Determining linear separability is an important way to understand structures present in data.We review the behavior of several classical descent procedures for determining linear separability and training linear classifiers in the presence of linearly nonseparable input. We compare the adaptive procedures to linear programming methods using many pairwise discrimination problems from a public database. We found that the adaptive procedures have serious implementational problems that make them less preferable than linear programming.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Supervised learning via smoothed Polya trees

Article 12 October 2018

Linear classifiers and selection of informative features

Article 01 July 2017

Model Selection for Classification with a Large Number of Classes

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

M. Basu, Q. Liang. The fractional correction rule: a new perspective. Neural Network, 11, 1027–1039, 1998.
Article Google Scholar
C. Blake, E. Keogh, C.J. Merz. UCI repository of machine learning databases [http://www.ics.uci.edu/~mlearn/MLRepository.html]. Irvine, CA: University of California, Department of Information and Computer Science, 1998.
Google Scholar
H.D. Block, S.A. Levin. On the boundedness of an iterative procedure for solving a system of linear inequalities. Proc. AMS, 26, 229–235, 1970.
Article MathSciNet MATH Google Scholar
K.P. Bennett, O.L. Mangasarian. Robust linear programming discrimination of two linearly inseparable sets. Optimization Methods and Software, 1, 23–24, 1992.
Google Scholar
A.R. Butz. Perceptron type learning algorithms in nonseparable situations. Journal of Mathematical Analysis and Applications, 17, 560–576, 1967.
Article MathSciNet MATH Google Scholar
R.O. Duda, P.E. Hart. Pattern Classification and Scene Analysis. New York: Wiley, 1973.
MATH Google Scholar
R. Fourer, D.M. Gay, B.W. Kernighan. AMPL: A Modeling Language for Mathematical Programming. South San Francisco: The Scientific Press, 1993.
Google Scholar
S.I. Gallant. Neural Network Learning & Expert Systems. Cambridge, MA: MIT Press, 1993.
MATH Google Scholar
F. Glover. Improved linear programming models for discriminant analysis. Decision Sciences, 21(4), 771–785, 1990.
Google Scholar
R.G. Grinold. Comment on “Pattern Classification Design by Linear Programming”. IEEE Transactions on Computers, C-18(4), 378–379, April 1969.
Google Scholar
R.G. Grinold. Mathematical programming methods of pattern classification. Management Science, 19(3), 272–289, 1972.
MathSciNet MATH Google Scholar
M.H. Hassoun, J. Song. Adaptive Ho-Kashyap rules for perceptron training. IEEE Transactions on Neural Networks, 3, 51–61, 1992.
Article Google Scholar
Y.C. Ho, R.L. Kashyap. An algorithm for linear inequalities and its applications. IEEE Transactions on Electronic Computers, 14, 683–688, 1965.
Google Scholar
S.-C. Huang, Y.-F. Huang. Bounds on the number of hidden neurons in multilayer perceptrons. IEEE Transactions on Neural Networks, 2, 47–55, 1991.
Article Google Scholar
T.Y. Kwok, D.Y. Yeung. Objective functions for training new hidden units in constructive neural networks. IEEE Transactions on Neural Networks, 8(5), 1131–1148, 1997.
Article Google Scholar
O.L. Mangasarian. Linear and nonlinear separation of patterns by linear programming. Operations Research, 13, 444–452, 1965.
Article MathSciNet MATH Google Scholar
O.L. Mangasarian. Nonlinear Programming, New York: McGraw-Hill, 1969.
MATH Google Scholar
C.H. Mays. Adaptive Threshold Logic. Ph.D. thesis, Stanford Electronics Labs, Stanford, CA, 1963.
Google Scholar
M. Minsky, S. Papert. Perceptrons, expanded edition. Cambridge, MA: MIT Press, 1988.
MATH Google Scholar
B.A. Murtagh, M.A. Saunders. Large-scale linearly constrained optimization. Mathematical Programming, 14, 41–72, 1978.
Article MathSciNet MATH Google Scholar
N.J. Nilsson. The Mathematical Foundations of Learning Machines. San Mateo, CA: Morgan Kaufmann, 1990.
MATH Google Scholar
R. Reed. Pruning algorithms—a survey. IEEE Transactions on Neural Networks, 4, 740–747, 1993.
Article Google Scholar
F. Rosenblatt. Principles of Neurodynamics: Perceptron and the Theory of Brain Mechanism. Washington, D.C.: Spartan Press, 1962.
Google Scholar
V.P. Roychowdhury, K.Y. Siu, T. Kailath. Classification of linearly nonseparable patterns by linear threshold elements. IEEE Transactions on Neural Networks, 6(2), 318–331, March 1995.
Article Google Scholar
M.A. Sartori, P.J. Antsaklis, A simple method to derive bounds on the size and to train multilayer neural networks. IEEE Transactions on Neural Networks, 2, 467–471, 1991.
Article Google Scholar
K.Y. Siu, V.P. Roychowdhury, T. Kailath. Discrete Neural Computation. Englewood Cliffs, NJ: Prentice Hall, 1995.
MATH Google Scholar
F.W. Smith. Pattern classifier design by linear programming. IEEE Transactions on Computers, C-17(4), 367–372, April 1968.
Google Scholar
S. Tamura, T. Masahiko. Capabilities of a four-layered feedforward neural network: four layer versus three. IEEE Transactions on Neural Networks, 8, 251–255, 1997.
Article Google Scholar
J.T. Tou, R.C. Gonzalez. Pattern Recognition Principles. Reading, MA: Addison-Wesley, 1974.
MATH Google Scholar
A.W. Tucker. Dual systems of homogeneous linear relations. In H.W. Kuhn, A.W. Tucker, eds., Linear Inequalities and Related Systems. Annals of Mathematics Studies Number 38, Princeton, NJ: Princeton University Press, pages 3–18, 1956.
Google Scholar
V.N. Vapnik, A.J. Chervonenkis. Theory of Pattern Recognition (in Russian). Nauka, Moscow, 1974; German translation: W.N. Wapnik, A.J. Tschervonenkis. Theorie der Zeichenerkennung. Berlin: Akademia, 1979.
Google Scholar
V.N. Vapnik. Statistical Learning Theory. New York: Wiley, 1998.
MATH Google Scholar
B. Widrow, M.E. Hoff, Jr. Adaptive switching circuits. Tech. Report 1553-1, Stanford Electronics Labs, Stanford, CA, 1960.
Google Scholar
B. Widrow, M.A. Lehr. 30 years of adaptive neural networks: perceptron, madeline, and backpropagation. Proceedings of the IEEE, 78, 1415–1442, 1990.
Article Google Scholar
B. Widrow, S.D. Stearns. Adaptive Signal Processing. Englewood Cliffs, NJ: Prentice Hall, 1985.
MATH Google Scholar
M.H. Wright, Interior methods for constrained optimization. In A. Iserles, ed. Acta Numerica, pages 341–407, Cambridge: Cambridge University Press, 1992.
Google Scholar

Download references

Author information

Authors and Affiliations

National Science Foundation, 4201 Wilson Blvd., Arlington, VA, 22230, USA
Mitra Basu
Mathematical & Algorithmic Sciences Research Center, Bell Laboratories, Lucent Technologies, Murray Hill, NJ, 07974-0636, USA
Tin Kam Ho

Authors

Mitra Basu
View author publications
You can also search for this author in PubMed Google Scholar
Tin Kam Ho
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Electrical Engineering Department, City College, City University of New York, USA
Mitra Basu PhD
Bell Laboratories, Lucent Technologies, New Jersey, USA
Tin Kam Ho BBA, MS, PhD

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Basu, M., Ho, T.K. (2006). Linear Separability in Descent Procedures for Linear Classifiers. In: Basu, M., Ho, T.K. (eds) Data Complexity in Pattern Recognition. Advanced Information and Knowledge Processing. Springer, London. https://doi.org/10.1007/978-1-84628-172-3_4

Download citation

DOI: https://doi.org/10.1007/978-1-84628-172-3_4
Publisher Name: Springer, London
Print ISBN: 978-1-84628-171-6
Online ISBN: 978-1-84628-172-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Linear Separability in Descent Procedures for Linear Classifiers

Summary

Chapter PDF

Similar content being viewed by others

Supervised learning via smoothed Polya trees

Linear classifiers and selection of informative features

Model Selection for Classification with a Large Number of Classes

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Linear Separability in Descent Procedures for Linear Classifiers

Summary

Chapter PDF

Similar content being viewed by others

Supervised learning via smoothed Polya trees

Linear classifiers and selection of informative features

Model Selection for Classification with a Large Number of Classes

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation