Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Researchers at Microsoft Showcase Latest Deep Learning Approach to Image Recognition

byEileen McNulty
October 31, 2014
in Artificial Intelligence, News

Researchers at Microsoft Research Asia have discovered a solution to the excruciatingly slow object detection that is characteristic to existing “deep convolutional neural networks” (CNNs) which has been published in Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition, a research paper written by Kaiming He and Jian Sun, along with a couple of academics serving internships at the Asia lab: Xiangyu Zhang of Xi’an Jiaotong University and Shaoqing Ren of the University of Science and Technology of China.

“Image recognition involves two core tasks: image classification and object detection,” Mr. He explains. “In image classification, the computer is taught to recognize object categories, such as “person,” “cat,” “dog,” or “bike,” while in object detection, the computer needs to provide the precise positions of the objects in the image.” The second task, Sun adds, is the more difficult of the two. “We need,” he says, “to answer ‘what and where’ for one or more objects in an image.”

Image recognition has gained rapidly from the use of deep neural networks and deep learning, along with the availability of prodigious data sets. Here, particularly, such networks are called CNNs, inspired by biological processes of the human brain, explains Microsoft Research.

However the algorithms are too slow for object detection in practice having to be applied thousands of times on a single image, for detecting a few objects. The new solution speeds up the process by almost 100 times, with impeccable accuracy.

They outline a new network structure that uses “spatial pyramid pooling” (SPP) technique to generate a descriptor from a region of any size.

The researchers, although quite proud of their breakthrough, believe that the field needs further exploration. “One of the important next steps,” Mr. Sun notes, “is to obtain much larger and richer training data. That will significantly impact the research in this direction.”

“Our work is the fastest deep-learning system for accurate object detection,” Mr. He said. “The speed is getting very close to the requirement for consumer usage.”

Read more here.

Follow @DataconomyMedia

(Image source: Microsoft)

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Tags: Computer VisionMicrosoft

Related Posts

EU launches €107M RAISE virtual institute to accelerate AI-driven science

EU launches €107M RAISE virtual institute to accelerate AI-driven science

November 4, 2025
AMD confirms critical RDSEED flaw in Zen 5 CPUs

AMD confirms critical RDSEED flaw in Zen 5 CPUs

November 4, 2025
Google rolls out redesigned Quick Share app for Windows

Google rolls out redesigned Quick Share app for Windows

November 4, 2025
WhatsApp for Mac adds chat themes with 38 color options

WhatsApp for Mac adds chat themes with 38 color options

November 4, 2025
Gemini now powers Google Translate’s “Advanced” mode

Gemini now powers Google Translate’s “Advanced” mode

November 4, 2025
Coca-Cola’s new AI-generated Christmas ad shows why generative video still struggles with realism

Coca-Cola’s new AI-generated Christmas ad shows why generative video still struggles with realism

November 4, 2025
Please login to join discussion

LATEST NEWS

EU launches €107M RAISE virtual institute to accelerate AI-driven science

AMD confirms critical RDSEED flaw in Zen 5 CPUs

Google rolls out redesigned Quick Share app for Windows

WhatsApp for Mac adds chat themes with 38 color options

Gemini now powers Google Translate’s “Advanced” mode

Coca-Cola’s new AI-generated Christmas ad shows why generative video still struggles with realism

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy Policy.