Article: Facebook AI Open Sources AugLy: A New Python Library For Data Augmentation To Develop Robust Machine Learning Models

Free, Open Data, Open Software

Facebook AI Open Sources AugLy: A New Python Library For Data Augmentation To Develop Robust Machine Learning Models

Facebook has recently open-sourced AugLy, a new Python library that aims to help AI researchers use data augmentations to evaluate and improve the durability of their machine learning models. AugLy provides sophisticated data augmentation tools to create samples to train and test different systems.

AugLy is a new open-source data augmentation library that combines audio, image, video, and text, becoming increasingly significant in several AI research fields. It offers over 100 data augmentations based on people’s real-life images and videos on platforms like Facebook and Instagram.

Read Full Article

Article: Open source software is an incredibly valuable assets, says Leonid Radvinsky

Language, Open Data

Open source software is an incredibly valuable assets, says Leonid Radvinsky

Leonid Radvinsky believes that open source software is the key to providing accessible technology to developers all over the world. These projects are maintained by a global network of contributing developers, and anyone interested can contribute to these technologies and use them in their projects at no cost.

Open source software is the foundation of a vibrant developer community, but it cannot be entirely fueled by passion. Developer time is a valuable resource, and contributors to open source software deserve to be paid for the work. Since open source software is available for free, people are understandably curious about how these projects are funded. Luckily, there are several funding options that open source projects can benefit from. Investors like Leonid Radvinsky are helping the open source movement by providing funding and encouraging widespread adoption of the technologies.

Read Full Article

Article: How cryptographic ledgers are helping geospatial researchers deal with information overload

Geospatial, Open Data

How cryptographic ledgers are helping geospatial researchers deal with information overload

Out of all the potential use cases of geospatial services, it could be that location-based real-time monitoring applications are the fastest growing. Some experts believe that these are expected to be the biggest drivers of the Earth Observation field in coming years, which could end up creating an unprecedented amount of data. Existing GIS solutions for long had to deal with increasingly large datasets, but this could potentially portend the creation of exponentially massive ones.

Computer industry representatives believe that blockchain-based solutions could be used to manage these geospatial datasets regardless of their physical size. Agricultural supply chain managers have been turning to distributed cryptographic ledgers to manage GIS data collected in that industry. Programmers might soon start to apply these to the observation industry, which has been one of the biggest information-creators in recent years.

Read Full Article

Article: Researchers open-source benchmarks measuring quality of AI-generated code

Language, Open Data

Researchers open-source benchmarks measuring quality of AI-generated code

The applications of computer programming are vast in scope. And as computers become ubiquitous, the demand for quality code draws an ever-growing number of aspiring programmers to the profession. After years of study to become proficient at coding, experts learn to convert abstracts into concrete, executable programs. But what if AI could do the same?

In recent years, large-scale AI language models have shown promise in generalizing to tasks including writing code, implying that humans’ work may be one day supplemented by AI systems. But while some studies show that language models can translate code and fix compilation issues, there’s been little work on rigorously testing the coding ability of models given general coding problems.

Read Full Article

Article: NIWC Atlantic Prototype Rolls Open Source Data, Sentiment into One Dashboard

Language, Open Data, Open Decision-Support

NIWC Atlantic Prototype Rolls Open Source Data, Sentiment into One Dashboard

Naval Information Warfare Center (NIWC) Atlantic recently unveiled a new dashboard technology that can deliver increased battlespace awareness to Marines by collecting and making sense of endless streams of open source information. Called Integrated Visualizations for Operations Running in an Information Environment (IVORIE), the unique capability carefully pulls from disparate open sources and integrates thousands of pieces of news, social media and other online data into one geospatially configured interface.

The end state is a computer monitor with a vivid and detailed display that projects a clearer picture on the information front for commanders and decision-makers. NIWC Atlantic demonstrated IVORIE for Marines and others at Camp Lejeune during Naval Integration in Contested Environments (NICE) Advanced Naval Technology Exercise (ANTX), which ended in mid-April.

Read Full Article

Article: Data-driven environmental decision-making and action in armed conflict

Access, Geospatial, Open Data, Open Decision-Support

Data-driven environmental decision-making and action in armed conflict

*A digital revolution through a myriad of earth observation data and open-source investigations is reshaping our understanding of the environmental causes and consequences of armed conflicts. From spatio-temporal analysis to near-real time monitoring of conflicts and resulting harm from scorched earth tactics, environmental data can quickly be incorporated in humanitarian action and reconstruction efforts.

In other words, the scope and severity of environmental damage in conflict is now better understood and more foreseeable. How can this transformative development influence military conduct to strengthen the protection of civilians and the environment in armed conflict?

Read Full Article

Article: The Five Ways To Build Machine Learning Models

Free, Language, Libre, Open Data, Open Software

The Five Ways To Build Machine Learning Models

Machine learning is powering most of the recent advancements in AI, including computer vision, natural language processing, predictive analytics, autonomous systems, and a wide range of applications. Machine learning systems are core to enabling each of these seven patterns of AI.

In order to move up the data value chain from the information level  to the knowledge level, we need to apply machine learning that will enable systems to identify patterns in data and learn from those patterns to apply to new, never before seen data. Machine learning is not all of AI, but it is a big part of it.

Read Full Article

Article: Facebook Launches AI That Understands Language Without Labels

Language, Open Data

Facebook Launches AI That Understands Language Without Labels

Decades ago, asking for basic directions or information would have been an arduous task. The advent of solutions such as Google Translate alleviated that problem to an extent. But, is this enough? For example, India alone is home to 23 official languages and thousands of unofficial ones and Google Translate supports just 11 of India’s languages.

Other speech recognition technologies might allow even fewer languages. Also, languages such as Basque and Swahili are far likelier to have more limited AI speech recognition capabilities than Hindi, English and Mandarin. Hence, the paths opened up by speech recognition technology are only available to a small fraction of the countless languages spoken all over the world. This is because most AI for speech recognition belongs to a category called supervised learning.

Read Full Article

Article: Foursquare adds geospatial analytics and visualizations power with Unfolded acquisition

Geospatial, Open Data

Foursquare adds geospatial analytics and visualizations power with Unfolded acquisition

Foursquare, a leading independent location technology company known for its city guides, announced its acquisition of Unfolded, a geospatial analytics platform.

With the addition of Unfolded’s capabilities to its technology stack, enterprises and brands can soon come to Foursquare to not only access its location data, but work with that data in an integrated platform for merging, enriching, analysing, and visualising spatial data – in whatever environment they choose. “Welcoming Unfolded to the team makes the Foursquare platform more powerful, robust, and accessible to our clients and partners,” said Gary Little, President and CEO of Foursquare.

Read Full Article

Article: The race to understand the exhilarating, dangerous world of language AI

Innovation, Language, Open Data, Open Space

The race to understand the exhilarating, dangerous world of language AI

To start, Google plans to integrate LaMDA into its main search portal, its voice assistant, and Workplace, its collection of cloud-based work software that includes Gmail, Docs, and Drive. But the eventual goal, said Pichai, is to create a conversational interface that allows people to retrieve any kind of information—text, visual, audio—across all Google’s products just by asking.

LaMDA’s rollout signals yet another way in which language technologies are becoming enmeshed in our day-to-day lives. But Google’s flashy presentation belied the ethical debate that now surrounds such cutting-edge systems. LaMDA is what’s known as a large language model (LLM)—a deep-learning algorithm trained on enormous amounts of text data.

Read Full Article