Generative AI in Your Desk Drawer: How to Get There

Andy Oram praxagora

Previous articles in this series have shown how generative AI can be used for administrative and back-office functions in health care. Now we’ll look at how models are trained for these specific purposes.

Training Generative AI Models

Every industry has to develop domain-specific models, and health care has the extra burden of protecting personally identifying data. These requirements raise the question of when the general-purpose solutions offered by major tech companies are appropriate, where health care organizations can get large enough data sets to develop models, and how vanilla LLMs—or foundational LLMS—can be enhanced by narrow data sets from a health care provider.

Harman Dhawan, founder and CEO of Bikham, says that now there are “fairly cheap” LLMs that providers can build on and customize. Not only are there well-known options from OpenAI and Google, but some LLMs are open source.

Jean-Claude Saghbini, President of the Lumeris Value-Based Care Enablement business, says, “Vanilla LLMs can certainly be used within specific solution designs that allow you to constrain and control the output. But in all cases, the use of AI for back-office work requires organizational guardrails. Team members have to be trained on how to use AI responsibly, and that means deploying a change management process to train and adopt this technology safely and effectively. Privacy concerns are an important consideration, particularly when using publicly facing AI platforms.”

Seek AI helps customers connect their structured data to LLMs, according to founder and CEO Sarah Nagy. She says that “training data does not necessarily need to be large to be effective.”

She adds, “It is best to start small, employing just the most important datasets, when working with LLMs. One reason for this is to get used to the novel workflows resulting from LLMs. Once acquainted with these workflows, the organization can expand to additional datasets.”

Iodine Software, according to chief product and technology officer Priti Shah, has the necessary business associate agreements (BAAs) to get patient data from their customers. She says that 27% of all U.S. patient admissions flow through Iodine solutions, including real-time data.

When you remember that the now-discredited IBM Watson was trained on research papers, you can understand why using actual patient data is crucial.

Melvin Lai, senior associate at Silicon Foundry, says that use cases vary, but that “training on a dataset ranging from hundreds of gigabytes to several terabytes of text data should yield a well-functioning LLM. ChatGPT-3 was trained on approximately 45 terabytes of text data. Models focused on specific tasks or domains typically require less data to develop, but this raises the importance of curating the quality of input.”

Nick Stepro, chief product and technology officer of Arcadia, says, “As one example, a patient’s A1C may be formatted in many ways within an EHR. Training a model to identify those variables and consistently map them correctly ensures the most valuable and useful output. Programmers should train models to deliver an output in a specific format every time. This makes the application more reliable and dependable, providing the consistency users expect.”

SS&C Blue Prism, according to Anna Twomey, senior director of healthcare, develops a generational AI model as follows: They start with either a vanilla foundational model or one based only on medical records. In traditional machine learning parlance, the results of the models are called vector tables and consist of rules such as “six percent of the decision depends on age, eight percent on the presence of diabetes,” etc. So SS&C Blue Prism analyzes the clients’ own data to apply a customized vector table.

For audits and compliance, the tool can calculate metrics from the Healthcare Effectiveness Data and Information Set (HEDIS). These help an organization track how well it’s carrying out treatment, identify gaps in patient communications, and fill these gaps. Figure 1 shows a typical screen from SS&C Blue Prism.

A laptop with a view into a flowchart in SS&C Blue Prism. — Figure 1. SS&C Blue Prism interface.

Erik Barnett, North America Advisory Healthcare & Life Sciences Lead at Avanade, says that their clients normally run the service on internal data. For instance, staff can create a presentation by searching existing company documents, optionally accepting data from the Web as well.

Abhishek Sharma, principal of business transformation at Sagility, says they use generative AI to generate synthetic data for use cases around specific machine learning models for payers and providers when data is lacking. He advises health care institutions to combine generative AI with other digital assets and deep domain expertise to create a holistic solution.

Vignesh Ravikumar, partner at Sierra Ventures, predicts that industries will move over time to smaller, more specialized LLMs.

Chief customer officer Deirdre Leone at ContractPodAi believes that success for generative AI in contract development depends on domain-specific models, where specialized LLMs are trained to understand complex legal situations while also protecting sensitive patient information to avoid inaccuracies and misuse. “With this specialized information, a legal team can confidently draw up contracts and oversee them throughout their life cycle in more productive and efficient ways than before.”

Cameron Andrews, founder and CEO of Sirona Medical, writes to me, “Choosing LLMs is like hiring people: Some are smarter, some are more specialized, and some are more expensive than others. Health care organizations should focus on their IT infrastructure first, to ensure that they have the tools to pick, swap, combine, and tune LLMs easily and quickly or identify vendors and partners that do.”

Akshay Sharma, chief AI officer at Lyric, says they use a combination—what he calls an “orchestra”—of relatively Small Language Models (SMLs) that they can fine-tune and run on cheaper GPUs and even CPUs. Using their own data as input, they can develop special models, such as to reason and understand fraud, waste, and abuse, for coordination of benefits, and for other tasks in payment integrity.

By analyzing claims data and identifying patterns that may indicate fraudulent activities, healthcare providers can reduce the risk of financial loss and improve billing accuracy.

David Kereiakes, managing partner at Windham Venture Partners, says that organizations should include the end-users in the design process, using them for testing and opinions.

CitiusTech has recently announced a testing platform to evaluate generative AI quality, the CitiusTech Gen AI Quality & Trust Solutions. Sridhar Turaga, senior vice president, data and analytics, noted that, “Up to now, there have been no established technology-agnostic and platform-agnostic solutions that measure the quality and trust of healthcare generative AI, end-to-end. Approaches used in building and evaluating LLMs and foundation models are useful, but have not been designed specifically for healthcare.”

The CitiusTech solution enables clients to measure their models for accuracy, calibration, robustness, fairness, bias, toxicity, and efficiency. Multiple health care innovators beta-tested the approach, which can be integrated into existing MLOps, DataOps, and quality management solutions.

The final article in this series will take on the crucial issue of helping small providers, already strained past their limits to meet current patient needs, derive the benefits that this series has ascribed to generative AI.

About the author

View All Posts

Andy Oram

Andy is a writer and editor in the computer field. His editorial projects have ranged from a legal guide covering intellectual property to a graphic novel about teenage hackers. A correspondent for Healthcare IT Today, Andy also writes often on policy issues related to the Internet and on trends affecting technical innovation and its effects on society. Print publications where his work has appeared include The Economist, Communications of the ACM, Copyright World, the Journal of Information Technology & Politics, Vanguardia Dossier, and Internet Law and Business. Conferences where he has presented talks include O'Reilly's Open Source Convention, FISL (Brazil), FOSDEM (Brussels), DebConf, and LibrePlanet. Andy participates in the Association for Computing Machinery's policy organization, named USTPC, and is on the editorial board of the Linux Professional Institute.

Everside Health and Marathon Health Announce Merger to Meet Accelerating Employer Demand for Advanced Primary Care Services

Three Key Takeaways From ViVE2024

Cookie	Duration	Description
__cfruid	session	This cookie is set by the provider Cloudflare. This cookie is used for load balancing and for identifying trusted web traffic.
_GRECAPTCHA	5 months 27 days	This cookie is set by Google. In addition to certain standard Google cookies, reCAPTCHA sets a necessary cookie (_GRECAPTCHA) when executed for the purpose of providing its risk analysis.
AWSALBCORS	7 days	This cookie is used for load balancing services provded by Amazon inorder to optimize the user experience. Amazon has updated the ALB and CLB so that customers can continue to use the CORS request with stickness.
AWSELB	session	This cookie is associated with Amazon Web Services and is used for managing sticky sessions across production servers.
cf_ob_info		This cookie is set by the provider Cloudflare. The cookie provides informations on HTTP Status Code returned by the origin web server, the Ray ID of the original failed request and the data center serving the traffic.
cf_use_ob		This cookie is set by the provider Cloudflare content delivery network. This cookie is used for determining whether it should continue serving "Always Online" until the cookie expires.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-non-necessary	1 hour	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non-necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
gdpr_status	6 months 2 days	This cookie is set by the provider Media.net. This cookie is used to check the status whether the user has accepted the cookie consent box. It also helps in not showing the cookie consent box upon re-entry to the website.
JSESSIONID	session	Used by sites written in JSP. General purpose platform session cookies that are used to maintain users' state across page requests.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
ts	1 year 1 month	This cookie is provided by the PayPal. It is used to support payment service in a website.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie is set by CloudFlare. The cookie is used to support Cloudflare Bot Management.
_alid_	session	This cookie is set by the provider mielevod-vh.akamaihd.net. This cookie is used for making the live streaming of video content more efficient.
akavpau_ppsd	session	This cookie is provided by Paypal. The cookie is used in context with transactions on the website.
bcookie	2 years	This cookie is set by linkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
language	session	This cookie is used to store the language preference of the user.
lidc	1 day	This cookie is set by LinkedIn and used for routing.
sp_landing	1 day	This cookie is set by the provider Spotify. This cookie is used to implement audio content from spotify on the website. It also helps in collecting information on user interaction with this audio content.
sp_t	1 year	This cookie is set by the provider Spotify. This cookie is used to implement audio content from spotify on the website. It also helps in collecting information on user interaction with this audio content.
v1st	1 year 1 month	This cookie is set by the provider TripAdvisor. This cookie is used to show user reviews, awards and information recieved on the community of TripAdvisor. It helps to collect information about how visitors use the website.

Cookie	Duration	Description
AWSELBCORS	session	This cookie is used for load balancing, inorder to optimize the service. It also stores the information regarding which server cluster is serving the visitor.
dmvk	session	This cookie is set by the provider Dailymotion. This cookie is used for collecting statistical data of the visitor behaviour on the website. It is used for internal analytics.
sid	past	This cookie is very common and is used for session state management.

Cookie	Duration	Description
__gads	1 year 24 days	This cookie is set by Google and stored under the name dounleclick.com. This cookie is used to track how many times users see a particular advert which helps in measuring the success of the campaign and calculate the revenue generated by the campaign. These cookies can only be read from the domain that it is set on so it will not track any data while browsing through another sites.
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_131168995_1	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
CONSENT	16 years 4 months 2 days 9 hours	These cookies are set via embedded youtube-videos. They register anonymous statistical data on for example how many times the video is displayed and what settings are used for playback.No sensitive data is collected unless you log in to your google account, in that case your choices are linked with your account, for example if you click “like” on a video.
UID	2 years	No description available.
vuid	2 years	This domain of this cookie is owned by Vimeo. This cookie is used by vimeo to collect tracking information. It sets a unique ID to embed videos to the website.
WMF-Last-Access	1 month 20 hours	This cookie is used to calculate unique devices accessing the website.

Cookie	Duration	Description
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
DSID	1 hour	This cookie is setup by doubleclick.net. This cookie is used by Google to make advertising more engaging to users and are stored under doubleclick.net. It contains an encrypted unique ID.
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
NID	6 months	This cookie is used to a profile based on user's interest and display personalized ads to the users.
OAGEO	session	This cookie is set by the provider OpenX. This cookie is used for advertising campaigns on the website. The cookie helps in avoiding the same ad showing repeatedly.
OAID	1 year	This cookie is set when an AdsWizz website visitor have opted out the collection of information by AdsWizz service or opted to disable the targeted ads by AdsWizz.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.
YSC	session	This cookies is set by Youtube and is used to track the views of embedded videos.
yt-remote-connected-devices	never	These cookies are set via embedded youtube-videos.
yt-remote-device-id	never	These cookies are set via embedded youtube-videos.
yt.innertube::nextId	never	These cookies are set via embedded youtube-videos.
yt.innertube::requests	never	These cookies are set via embedded youtube-videos.

Generative AI in Your Desk Drawer: How to Get There

About the author

Andy Oram

Just for You

Healthcare IT Podcasts

Featured Articles

Kno2’s Unique Perspective on Why TEFCA is Different as They Pursue QHIN Designation

MRO Lets Payers and Providers See the Same Ledger

Asparia Chatbot Integration with Epic Yields a Seamless Experience for Patients and Epic Users

Patient Transfers to Post-Acute Care Rely Largely on Outdated Manual Methods, Hindering Optimal Care

Solving the Challenge of High-volume Health Plan Chart Requests

Categories

Popular Articles

Healthcare IT Today Podcast

Follow Us

You may also like

About the author

Andy Oram

Just for You

Healthcare IT Podcasts

Featured Articles

Categories

Popular Articles

Healthcare IT Today Podcast

Follow Us