Mostrar el registro sencillo del ítem

Ponencia

dc.creatorTapiador Morales, Ricardoes
dc.creatorRíos Navarro, José Antonioes
dc.creatorLinares Barranco, Alejandroes
dc.creatorKim, Minkyues
dc.creatorKadetotad, Deepakes
dc.creatorSeo, Jae-sunes
dc.date.accessioned2020-02-13T11:45:20Z
dc.date.available2020-02-13T11:45:20Z
dc.date.issued2017
dc.identifier.citationTapiador Morales, R., Rios Navarro, A., Linares Barranco, A., Kim, M., Kadetotad, D. y Seo, J. (2017). Comprehensive Evaluation of OpenCL-Based CNN Implementations for FPGAs. En IWANN 2017: 14th International Work-Conference on Artificial Neural Networks (271-282), Cadiz, España: Springer.
dc.identifier.isbn978-3-319-59146-9es
dc.identifier.issn0302-9743es
dc.identifier.urihttps://hdl.handle.net/11441/93008
dc.description.abstractDeep learning has significantly advanced the state of the art in artificial intelligence, gaining wide popularity from both industry and academia. Special interest is around Convolutional Neural Networks (CNN), which take inspiration from the hierarchical structure of the visual cortex, to form deep layers of convolutional operations, along with fully connected classifiers. Hardware implementations of these deep CNN architectures are challenged with memory bottlenecks that require many convolution and fully-connected layers demanding large amount of communication for parallel computation. Multi-core CPU based solutions have demonstrated their inadequacy for this problem due to the memory wall and low parallelism. Many-core GPU architectures show superior performance but they consume high power and also have memory constraints due to inconsistencies between cache and main memory. OpenCL is commonly used to describe these architectures for their execution on GPGPUs or FPGAs. FPGA design solutions are also actively being explored, which allow implementing the memory hierarchy using embedded parallel BlockRAMs. This boosts the parallel use of shared memory elements between multiple processing units, avoiding data replicability and inconsistencies. This makes FPGAs potentially powerful solutions for real-time classification of CNNs. In this paper both Altera and Xilinx adopted OpenCL co-design frameworks for pseudo-automatic development solutions are evaluated. A comprehensive evaluation and comparison for a 5-layer deep CNN is presented. Hardware resources, temporal performance and the OpenCL architecture for CNNs are discussed. Xilinx demonstrates faster synthesis, better FPGA resource utilization and more compact boards. Altera provides multi-platforms tools, mature design community and better execution times.es
dc.description.sponsorshipMinisterio de Economía y Competitividad TEC2016-77785-Pes
dc.formatapplication/pdfes
dc.language.isoenges
dc.publisherSpringeres
dc.relation.ispartofIWANN 2017: 14th International Work-Conference on Artificial Neural Networks (2017), p 271-282
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectDeep learninges
dc.subjectConvolutional Neural Networks (CNN)es
dc.subjectHardware Accelerationes
dc.subjectOpenCLes
dc.subjectFPGAes
dc.subjectCaffees
dc.subjectXilinxes
dc.subjectAlteraes
dc.titleComprehensive Evaluation of OpenCL-Based CNN Implementations for FPGAses
dc.typeinfo:eu-repo/semantics/conferenceObjectes
dcterms.identifierhttps://ror.org/03yxnpp24
dc.type.versioninfo:eu-repo/semantics/submittedVersiones
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.contributor.affiliationUniversidad de Sevilla. Departamento de Arquitectura y Tecnología de Computadoreses
dc.relation.projectIDTEC2016-77785-Pes
dc.relation.publisherversionhttps://link.springer.com/chapter/10.1007/978-3-319-59147-6_24es
dc.identifier.doi10.1007/978-3-319-59147-6_24es
dc.contributor.groupUniversidad de Sevilla. TEP-108: Robótica y Tecnología de Computadores Aplicada a la Rehabilitaciónes
idus.format.extent12es
dc.publication.initialPage271es
dc.publication.endPage282es
dc.eventtitleIWANN 2017: 14th International Work-Conference on Artificial Neural Networkses
dc.eventinstitutionCadiz, Españaes
dc.relation.publicationplaceBerlines

FicherosTamañoFormatoVerDescripción
Comprehensive Evaluation of ...5.320MbIcon   [PDF] Ver/Abrir  

Este registro aparece en las siguientes colecciones

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como: Attribution-NonCommercial-NoDerivatives 4.0 Internacional