Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Researchers at Microsoft Showcase Latest Deep Learning Approach to Image Recognition

byEileen McNulty
October 31, 2014
in Artificial Intelligence, News
Home News Artificial Intelligence
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail

Researchers at Microsoft Research Asia have discovered a solution to the excruciatingly slow object detection that is characteristic to existing “deep convolutional neural networks” (CNNs) which has been published in Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition, a research paper written by Kaiming He and Jian Sun, along with a couple of academics serving internships at the Asia lab: Xiangyu Zhang of Xi’an Jiaotong University and Shaoqing Ren of the University of Science and Technology of China.

“Image recognition involves two core tasks: image classification and object detection,” Mr. He explains. “In image classification, the computer is taught to recognize object categories, such as “person,” “cat,” “dog,” or “bike,” while in object detection, the computer needs to provide the precise positions of the objects in the image.” The second task, Sun adds, is the more difficult of the two. “We need,” he says, “to answer ‘what and where’ for one or more objects in an image.”

Image recognition has gained rapidly from the use of deep neural networks and deep learning, along with the availability of prodigious data sets. Here, particularly, such networks are called CNNs, inspired by biological processes of the human brain, explains Microsoft Research.

However the algorithms are too slow for object detection in practice having to be applied thousands of times on a single image, for detecting a few objects. The new solution speeds up the process by almost 100 times, with impeccable accuracy.

They outline a new network structure that uses “spatial pyramid pooling” (SPP) technique to generate a descriptor from a region of any size.

The researchers, although quite proud of their breakthrough, believe that the field needs further exploration. “One of the important next steps,” Mr. Sun notes, “is to obtain much larger and richer training data. That will significantly impact the research in this direction.”

“Our work is the fastest deep-learning system for accurate object detection,” Mr. He said. “The speed is getting very close to the requirement for consumer usage.”

Read more here.

Follow @DataconomyMedia

(Image source: Microsoft)

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Tags: Computer VisionMicrosoft

Related Posts

Google Workspace adds password-protected Office file editing

Google Workspace adds password-protected Office file editing

January 20, 2026
Claim: NVIDIA green-lit pirated book downloads for AI training

Claim: NVIDIA green-lit pirated book downloads for AI training

January 20, 2026
Tesla restarts Dojo3 supercomputer project as AI5 chip stabilizes

Tesla restarts Dojo3 supercomputer project as AI5 chip stabilizes

January 20, 2026
OpenAI targets “practical adoption” for 2026 strategy

OpenAI targets “practical adoption” for 2026 strategy

January 20, 2026
Nvidia hits 200 teraFLOP emulated FP64 for scientific computing

Nvidia hits 200 teraFLOP emulated FP64 for scientific computing

January 19, 2026
Walmart maintains Apple Pay ban in U.S. stores for 2026

Walmart maintains Apple Pay ban in U.S. stores for 2026

January 19, 2026
Please login to join discussion

LATEST NEWS

Google Workspace adds password-protected Office file editing

Claim: NVIDIA green-lit pirated book downloads for AI training

Tesla restarts Dojo3 supercomputer project as AI5 chip stabilizes

OpenAI targets “practical adoption” for 2026 strategy

Nvidia hits 200 teraFLOP emulated FP64 for scientific computing

Walmart maintains Apple Pay ban in U.S. stores for 2026

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy Policy.