Want Better AI? Get Input From a Real (Human) Expert

Scientists at the Department of Energy’s Pacific Northwest National Laboratory have put forth a new way to evaluate an AI system’s recommendations. They bring human experts into the loop to view how the ML performed on a set of data.  The expert learns which types of data the machine-learning system typically classifies correctly, and which data types lead to confusion and system errors. Armed with this knowledge, the experts then offer their own confidence score on future system recommendations.

New Data on LLM Accuracy

Juan Sequeda, Principal Scientist at data.world, recently published a research paper, “A Benchmark to Understand the Role of Knowledge Graphs on Large Language Model’s Accuracy for Question Answering on Enterprise SQL Databases.” He and his co-authors benchmarked LLM accuracy in answering questions over real business data.

NEW Survey: Orgs are Ready for More GenAI, but Ill-equipped

83% of organizations are in the exploration or experimentation phase of their GenAI journey, but implementation and risk management challenges stand in the way of expansion. This is according to a new Dataiku and Forrester survey of AI decision-makers: “Accelerate Generative AI Applications With
Platform Capabilities.”

How CISOs Can Enable Productization of Valuable Data Assets

In this contributed article, Ellie Dobson, VP Product at Apheris, suggests that organisations know that data is often the most valuable asset that they own. But unlocking the full potential of these data assets is difficult in the face of privacy and security concerns. This is compounded by a lack of existing infrastructure that allows for collaboration across organisations.

IT Survey Finds Enterprises Identify Automation and Generative AI as Top Business Priorities

Digitate, a leading provider of SaaS-based enterprise software for IT and business operations, released the results of its new study, “AI and Automation: Laying the Foundation for the Autonomous Enterprise” revealing that 90% of IT decision-makers plan to deploy more automation, including AI, in the next 12 months.

Lightning AI Introduces Lightning AI Studios; its Enterprise-Grade Platform for Rapid-prototyping, and Deploying AI Products

Lightning AI, the company behind PyTorch Lightning, with over 91 million downloads, announced the introduction of Lightning AI Studios, the culmination of 3 years of research into the next generation development paradigm for the age of AI.

New Report on Potential Impact of AI on Future of Work

A new special report, “Future of Work Report: AI at Work,” was just-released by LinkedIn that looks at the emerging trends within AI in the workplace and examines the potential impact of AI on the future of work. 

REPORT: The C-suite Weighs in on Gen AI

Thomson Reuters’ “Future of Professionals C-Suite Report,” reveals that while the majority of the C-suite is already or planning to use Gen AI at work, an overwhelming 87% of execs are concerned with how it will be deployed. 

Deci Unveils DeciLM-7B: A Leap Forward in Language Model Performance and Inference Cost Efficiency

Deci, the deep learning company harnessing AI to build AI, unveiled the latest addition to its suite of innovative generative AI models, DeciLM-7B, a 7 billion parameter large language model. Building upon the success of its predecessor DeciLM 6B, DeciLM 7B is setting new benchmarks in the large language model (LLM) space, outperforming prominent open-source models such as Llama2 7B and Mistral 7B in both accuracy and efficiency. 

Delphina Raises $7.5 Million to Build Copilot for Data Science

Delphina, an LLM-powered copilot for data science, announced the closing of a $7.5 million seed round co-led by Costanoa Ventures and Radical Ventures with participation from 20+ prominent angel investors, including Stanford professor Fei-Fei Li. The funding will be used to expand its team and accelerate product development.