Патент США № | 9978013 |
---|---|
Автор(ы) | Kaufhold |
Дата выдачи | 22 мая 2018 г. |
The present invention is directed to systems and methods for detecting objects in a radar image stream. Embodiments of the invention can receive a data stream from radar sensors and use a deep neural network to convert the received data stream into a set of semantic labels, where each semantic label corresponds to an object in the radar data stream that the deep neural network has identified. Processing units running the deep neural network may be collocated onboard an airborne vehicle along with the radar sensor(s). The processing units can be configured with powerful, high-speed graphics processing units or field-programmable gate arrays that are low in size, weight, and power requirements. Embodiments of the invention are also directed to providing innovative advances to object recognition training systems that utilize a detector and an object recognition cascade to analyze radar image streams in real time. The object recognition cascade can comprise at least one recognizer that receives a non-background stream of image patches from a detector and automatically assigns one or more semantic labels to each non-background image patch. In some embodiments, a separate recognizer for the background analysis of patches may also be incorporated. There may be multiple detectors and multiple recognizers, depending on the design of the cascade. Embodiments of the invention also include novel methods to tailor deep neural network algorithms to successfully process radar imagery, utilizing techniques such as normalization, sampling, data augmentation, foveation, cascade architectures, and label harmonization.
Авторы: | John Patrick Kaufhold (Arlington, VA) | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Патентообладатель: |
|
||||||||||
Заявитель: | Deep Learning Analytics, LLC (Arlingon, VA) |
||||||||||
ID семейства патентов | 55074839 | ||||||||||
Номер заявки: | 14/794,376 | ||||||||||
Дата регистрации: | 08 июля 2015 г. |
Document Identifier | Publication Date | |
---|---|---|
US 20160019458 A1 | Jan 21, 2016 | |
Application Number | Filing Date | Patent Number | Issue Date | ||
---|---|---|---|---|---|
62025075 | Jul 16, 2014 | ||||
Класс патентной классификации США: | 1/1 |
Класс совместной патентной классификации: | G01S 7/417 (20130101); G06N 3/0454 (20130101); G01S 13/90 (20130101); G01S 13/904 (20190501) |
Класс международной патентной классификации (МПК): | G01S 13/90 (20060101); G01S 7/41 (20060101); G06N 3/04 (20060101) |
Область поиска: | ;342/25F |
7587064 | September 2009 | Owechko |
2008/0243383 | October 2008 | Lin |
2010/0103029 | April 2010 | Khatwa |
2010/0109938 | May 2010 | Oswald |
2011/0029471 | February 2011 | Chakradhar |
2013/0343641 | December 2013 | Mnih |
2014/0293091 | October 2014 | Rhoads |
T D. Ross, S. W. Worrell, V. J. Velten, J. C. Mossing, and M. L. Bryant, "Standard SAR AIR evaluation experiments using the MSTAR public release data set," in Aerospace/Defense Sensing and Controls, 1998, pp. 566-573. cited by applicant . J. C. Mossing and T. D. Ross; "Evaluation of SAR ATR algorithm performance sensitivity to MSTAR extended 1 operating conditions," in Aerospace/Defense Sensing and Controls, 1998, pp. 554-565. cited by applicant . T. D. Ross, J. J. Bradley, L. J. Hudson, and M. P. O'Connor, "SAR ATR: so what's the problem? An MSTAR perspective," in AeroSense'99, 1999; pp. 662-672. cited by applicant . A. B. Muccio and T. B. Scruggs, "Moving Target Indicator (MTI) Applications for Unmanned Aerial Vehicles (UAVS)," in Radar Conference, 2003. Proceedings of the International, 2003, pp. 541-546. cited by applicant . J. P. How, C. Fraser, K. C. Kulling, L. F. Bertuccelli, O. Toupet, L. Brunet, A. Bachrach, and N. Roy, "Increasing autonomy of UAVs," Robot. Autom. Mag. IEEE, vol. 16, No. 2, pp. 43-51, 2009. cited by applicant . Y. LeCun, F. J. Huang, and L. Bottou, "Learning methods for generic object recognition with invariance to pose and lighting," in Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer I Society Conference on, 2004, vol. 2, pp. II-97. cited by applicant . A. Krizhevsky, I. Sutskever, and G. Hinton, "imagenet classification with deep convolutional neural networks," In Advances in Neural Information Processing Systems 25, 2012. pp. 1106-1114. cited by applicant . L. Deng, G. Hinton, and B. Kingsbury, "New types of deep neural network learning for speech recognition and related applications: An overview," in Proc. ICASSP, 2013. cited by applicant . A.-R. Mohamed, T. N. Sainath, G. Dahl, B. Ramahhadran, G. E. Hinton, and M. A. Picheny, "Deep belief networks using discriminative features for phone recognition," in Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, 2011, pp. 5060-5063. cited by applicant . D. Ciresan, A. Giusti, and J. Schmidhuber, "Deep neural networks segment neuronal membranes in electron micioscopy images," in Advances in Neural Information Processing Systems 25, 2012, pp. 2852-2860. cited by applicant . D. Ciresan, U. Meier, and J. Schmidhuber, "Multi-column deep neural networks for image classification," in Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, 2012, pp. 3642-3649. cited by applicant . J. Ngiarn, A. Khosla, M. Kim, J. Nam, H. Lee, and A. Ng, "Multimodal deep learning," in Proceedings of the 28th International Conference on Machine Learning (ICML-11), 2011, pp. 689-696. cited by applicant . D. M. Blei, T. L. Griffiths, and M. I. Jordan, "The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies," J. ACM JACM, vol. 57, No. 2, p. 7, 2010. cited by applicant . J. Paisley, C. Wang, D. M. Biel, and M. I. Jordan, "Nested hierarchical Dirichlet Processes," ArXiv Prepr. ArXiv1210.6738v4, May 2, 2014. cited by applicant . G. E. Hinton, "Training products of experts by minimizing contrastive divergence," Neural Comput., vol. 14, No. 8, pp. 11771-1800, 2002. cited by applicant . G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, No. 5786, pp. 504-507,2006. cited by applicant . R. Salakhutdinov, A. Mnih, and G. Hinton, "Restricted Boltzmann machines for collaborative filtering," In Proceedings of the 24th international conference on Machine learning, 2007, pp. 791-798. cited by applicant . M. Ranzato, F. J. Huang, Y.-L. Boureau, and Y. Lecun, "Unsupervised learning of invariant feature hierarchies with applications to object recognition," in Computer Vision and Pattern Recognition, 2007. CVPR'07. IEEE Conference on, 2007, pp. 1-8. cited by applicant . P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.-A. Manzagol, "Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion," J. Mach. Learn, Res., vol. 9999, pp. 3371-3408, 2010. cited by applicant . G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. R. Salakhutdinov, "Improving neural networks by preventing co-adaptation of feature detectors," ArXiv Prepr. ArXiv1207.0580v1, 2012. cited by applicant . G. E. Dahl, T. N. Sainath, and G. E. Hinton, "Improving Deep Neural Networks for LVCSR using Rectified Linear Units and Dropout," in Proc, ICASSP, 2013. cited by applicant . "ImageNet Large Scale Visual Recognition Competition 2012 (ILSVRC2012)." [Online]. Available: http://www.Image-net.org/challenges/LSVRC/2012/results.html. cited by applicant . S. Suvorova and J. Schroeder, "Automated Target Recognition Using the Karhunen--Loeve Transform with Invariance," Digit. Signal Process., vol. 12, No. 2, pp. 295-306, 2002. cited by applicant . T. D. Ross and L. C. Goodwon, "Improved automatic target recognition (ATR) value through enhancements and accommodations," in Defense and Security Symposium ; 2006; p. 62370T-62370T. cited by applicant . X. Yu, Y. Li, and L. C. Jiao, "SAR automatic target recognition based on classifiers fusion," in Multi-Platform/Multi-Sensor Remote Sensing Mapping (M2RSM), 2011 International Workshop 2011, 1-5. cited by applicant . "GeForce GTX 580 | Specifications | GeForce." [Online]. Available: http://www.geforce.com/hardware/desktop-gpus/geforce-gtx-580/specificatio- ns. cited by applicant . "GeForce GTX 780 | Specifications | GeForce." [Online]. Available: http://www.geforce.com/hardware/desktop-gpus/geforce-gtx-780/specificatio- ns. cited by applicant . "Parallel Programming and Computing Platform | CUDA | NVIDIA | NVIDIA." [Online]. Available: http://www.nvidia.com/object/cuda_home_new.htmi. cited by applicant . "Cuda-convnet--High-performance C++/CUDA implementation of convolutional neural networks--Google Project Hosting." [Online]. Available : http://code.google.com/p/cuda-convnet/. cited by applicant . G. E. Hinton and R. S. Zemel, "Autoencoders, minimum description length, and Helmholtz free energy," Adv. Neural Inf. Process. Syst., pp. 3-3, 1994. cited by applicant . P. Vincent, H. Larochelle, Y. Bengio, and P.-A. Manzagol, "Extracting and composing robust features with denoising autoencoders," in Proceedings of the 25th international conference on Machine learning, 2008, pp. 1096-1103. cited by applicant . G. E. Hinton, P. Dayan, and M. Revow, "Modeling the manifolds of images of handwritten digits," Neural Netw. IEEE Trans. On, vol. 8, No. 1, pp. 65-74, 1997. cited by applicant . R. Salakhutdinov and G. Hinton, "Semantic hashing," Int. J. Approx. Reason., vol. 50, No. 7, pp. 969-978, 2009. cited by applicant . R. Salakhutdinov and G. E. Hinton, "Deep boltzmann machines," in International Conference on Artificial Intelligence and Statistics, 2009, pp. 448-455. cited by applicant . D. Erhan, P.-A. Manzagol, Y. Bengio, S. Bengio, and P. Vincent, "The difficulty of training deep architectures and the effect of unsupervised pre-training," in International Conference on Artificial Intelligence and Statistics, 2009, pp. 153-160. cited by applicant . D. Erhan, Y. Bengio, A. Courville, P.-A. Manzagoi, P. Vincent, and S. Benglo, "Why does unsupervised pre-training help deep learning?," J. Mach. Learn, Res., 11, 625-660,2010. cited by applicant . B. M. Lake, R. Salakhutdirlov, J. Gross, and J. B. Tenenbaum, "One shot learning of simple visual concepts," in Proceedings of the 33rd Annual Conference of the Cognitive Science Society, 2011. cited by applicant . R. Salakhutdinov, A. Torralba, and J. Tenenbaum, "Learning to share visual appearance for multiclass object detection," in Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, 2011, pp. 1481-1488. cited by applicant . "Yann LeCun--Google+ - +Alex Krizhevsky's talk at the ImageNet ECCV workshop . . . " [Online]. Available: https://plus.google.com/104362980539466846301/post/JBBFfv2XgWM. cited by applicant . S. Bengio, L. Deng, H. Larochelle, H. Lee, and R. Salakhutdinov, "Guest Editors' Introduction: Special Section on Learning Deep Architectures." IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35. No. 8, Aug. 2013. cited by applicant . "Google Hires Brains that Helped Supercharge Machine Learning | Wired Enterprise | Wred.com." [Online]. Available: http://www.wired.com/wiredenterprise/2013/03/google_hinton/. cited by applicant . "Facebook is working on `deep learning` neural networks to learn even more about your personal life | ExtremeTech."[Online]. Available: http://www.extremetech.com/computing/167179-facebook-is-working-on-deep-l- earning-neural-net works-to-learn-even-more-about-your-personal-life. cited by applicant . "Facebook's `Deep Learning` Guru Reveals the Future of Al | Wired.com," Wired Enterprise. [Online]. Available: http://www.wired.com/wiredenterprise/2013/12/facebook-yann-lecun-qa/. cited by applicant . "New Techniques from Google and Ray Kurzweil Are Taking Artificial Intelligence to Another Level | MIT Technology Review." Available: http://www.technologyreview.com/featuredstory/513696/deep-learning/. cited by applicant . "Texas Advanced Computing Center--Stampede." [Online]. Available: http://www.tacc.utexas.edu/stampede/. cited by applicant . R. Caruana and A. Niculescu-Mizii, "An empirical comparison of supervised learning algorithms," in Proceedings of the 23rd international conference on Machine learning, 2006, pp. 161-168. cited by applicant . D. C. Ciresan, U. Meier, L. M. Gambardella, and J. Schmidhuber, "Deep, big, simple neural nets for handwritten digit recognition," Neural Comput., vol. 22, No. 12, pp. 3207-3220, 2010. cited by applicant . X. Glorot and Y. Benglo, "Understanding the difficulty of training deep feedforward neural networks," in International Conference on Artificial Intelligence and Statistics, 2010, pp. 249-256. cited by applicant . Y. Bengio, "Learning deep architectures for Al," Found. Trends.RTM. Mach. Learn., vol. 2, No. 1, pp. 1-127, 2009. cited by applicant . D. C. Ciresan, U. Meier, and J. Schmidhuber, "Transfer learning for Latin and Chinese characters with deep neural networks," in Neural Networks (IJCNN), The 2012 International Joint Conference on, 2012, pp. 1-6. cited by applicant . Yangoing, Jia, et al., "Caffee--Deep Learning Framework by the BVLC," [Online]. Available: http://caffe.berkeleyvision.org/tutorial/layers.html, 17 pgs. cited by applicant . Keller, John, "DARPA TRACE program using advanced algorithms, embedded computing for radar target recognition," Jul. 24, 2015 [Online]. Available http://www.militaryaerospace.com/articles/2015/07/hpec-radar-ta- rget-recognition.html, 3 pgs. cited by applicant . Fatica, Massimiliano, et al., "Synthetic Aperture Radar imaging on a CUDA-enabied mobile platform," IEEE, 2014, 5 pgs. cited by applicant . Cui, Zongyong, et al., "Hierarchical Recognition System for Target Recognition from Sparse Representations," Mathematical Problems in Engineering, vol. 2015, Article ID527095, 7 pgs. cited by applicant . Clemente, Carmine, et al., "Processing of synthetic Aperture Radar data with GPGPU," IEEE Xplore Abstract, Oct. 7, 2009. [Online]. Available: http://ieeexplore.ieee.org/xpl/articieDetails.jsp?reload=true&arnumber=53- 36272, 2 pgs. cited by applicant . Xie, Huiming, et al., "Multilayer feature learning for polarimetric synthetic radar data classification," IEEE Xplore Abstract, Jul. 13, 2014. [Online]. Available: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6947062, 2 pgs. cited by applicant . Xu, Hui, et al., "A new algorithm of SAR target recognition based on advance deep learning neural network," Computer Modelling & New Technologies, vol. 18, No. 12A, 2014, pp. 25-30. cited by applicant . Morgan, David A.E., "Deep convolutional neural networks for ATR from SAR imagery," SPIE Proceedings, May 13, 2015, 2 pgs. cited by applicant . Gong, Maoguo, et al., "Multi-objective Sparse Feature Learning Model for Deep Neural Networks," IEEE Transactions on Neural Networks and Learning Systems, vol. 26, No. 12, Dec. 2015, pp. 3263-3277. cited by applicant . "GeForce GTX TITAN | Specifications | GeForce." [Online]. Available: http://www.geforce.com/hardware/desktop-gpus/geforce-gtx-titan/specificat- ions. cited by applicant . "CUDA Performance--Nvidia GeForce GTX 780 Review: Titan's Baby Brother Is Born." [Online]. Available: http://www.tomshardware.com/reviews/geforce-gtx-780-performance-review,35- 16-26.html. cited by applicant . "Deep Learning." [Online]. Available: http://www.cs.toronto.edu/.about.rsalakhu/isbi.html. cited by applicant . R. Salakhutdinov, J. Tenenbaum, and A. Torralba, "One-shot learning with a hierarchical nonparametric bayesian model," 2010. cited by applicant . "AWS for US Federal Government." [Online]. Available: http://aws.amazon.com/federal/. cited by applicant . L. Wan, M. Zeiler, S. Zhang, Y. L. Cun, and R. Fergus, "Regularization of neural networks using dropconnect," in Proceedings of the 30th International Conference on Machine Learning (ICML-13), 2013, pp. 1058-1066. cited by applicant . I. J. Goodfellow, D. Warde-Farley, P. Lamblin, V. Dumoulin, M. Mirza, R. Pascanu, J. Bergstra, F. Bastien, and Y. Bengio, "Pylearn2: a machine learning research library," ArXiv Prepr. ArXiv13084214, 2013. cited by applicant . Simonyan & Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," ICLR 2015. cited by applicant . J.C. Duchi, E. Hazan, and Y. Singer, "Adaptive Subgradient Methods for Online Learning and Stochastic Optimization," Journal of Machine Learning Research, 2011. cited by applicant . Adadelta, as in Zeiler, M. D., "Adadelta: An Adaptive Learning Rate Method," CoRR, abs/1212.5701, 2012). cited by applicant . A. Hannun, C. Case, J. Casper, B. Catanzaro, G. Diamos, E. Elsen, R. Prenger, S. Satheesh, S. Sengupta, A. Coates, and A. Y. Ng, "Deep Speech: Scaling Up End-To-End Speech Recognition," CoRR, abs/1412.5567, 2014). cited by applicant . Le, Q., Ranzato, M., Monga, R., Devin, M., Chen, K., Corrado, G., Dean, J., and Ng, A., "Building High-Level Features Using Large Scale Unsupervised Learning," ICML, 2012). cited by applicant . V. Nair and G. E. Hinton, "Rectified linear units improve restricted Boltzmann machines," In Proc. 27th International Conference on Machine Learning, 2010. cited by applicant . G. Hinton, O. Vinyals, and J. Dean, "Distilling the Knowledge in a Neural Network," Neural Information Processing Systems: Workshop Deep Learning and Representation Learning Workshop, 2014. cited by applicant . M. Zeiler, R. Fergus, "Stochastic Pooling for Regularization of Deep Convolutional Neural Networks," ArXiv Prepr.ArXiv1301.3557v1 Jan. 16, 2013. cited by applicant . C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, "Going Deeper With Convolutions," ArXiv Prepr. ArXiv1409.4842v1, Sep. 17, 2014. cited by applicant . S. Wager, W. Wang, P. Liang, "Dropout Training as Adaptive Regularization.". cited by applicant . C. Szegedy, A. Toshev, D. Erhan, "Deep Neural Networks for Object Detection.". cited by applicant . DARPA Broad Agency Announcement "Target Recognition and Adaption in Contested Environments (TRACE)," Strategy Technology Office, DARPA-BAA-15-09, Dec. 1, 2014. cited by applicant . L. Deng, D. Yu, "Deep Learning Methods and Applications," Foundations and Trends in Signal Processing, vol. 7:3-4. cited by applicant. |