TrimFunction
New Member
- Joined
- Jan 9, 2018
- Messages
- 18
background: I'm building a reference file of branded product names. My aim is to have a clean list that can be used to search through a free text fields from a CRM system.
I want to find brand-names, but do not want common words like "account" or "management", since that will return almost every entry. the list of terms i search for is usually about 300+, using {=INDEX($B$2:$B$300,MATCH(1,COUNTIF(I3,"*"&$A$2:$A$300&"*"),0))}.
Question: What is the best way to "scrub" out the brand names from free text?
my current process is start with the list:
[TABLE="width: 311"]
<colgroup><col></colgroup><tbody>[TR]
[TD]Original Product Name[/TD]
[/TR]
[TR]
[TD]Accounts Receivable, JD Edwards EnterpriseOne[/TD]
[/TR]
[TR]
[TD]IBM Kenexa Employee Assessments[/TD]
[/TR]
[TR]
[TD]Catalog Management, PeopleSoft[/TD]
[/TR]
[TR]
[TD]SAP Ariba Invoice Management
Text to columns, sort, then manually read through to determine which are "brand" names:
Column A Column B
[TABLE="width: 194"]
<colgroup><col><col></colgroup><tbody>[TR]
[TD]Accounts[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]Ariba[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Assessments[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]Catalog[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]Edwards[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Employee[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]EnterpriseOne[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]IBM[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Invoice[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]JD[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Kenexa[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Management[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]Management,[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]PeopleSoft[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Receivable,[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]SAP[/TD]
[TD]USE[/TD]
[/TR]
</tbody>[/TABLE]
I've been using countifs to determine how frequent the word occurs and LEN to quickly go through short words, but this is not efficient.
Is there a better way to do this?[/TD]
[/TR]
[TR]
[TD][/TD]
[/TR]
</tbody>[/TABLE]
I want to find brand-names, but do not want common words like "account" or "management", since that will return almost every entry. the list of terms i search for is usually about 300+, using {=INDEX($B$2:$B$300,MATCH(1,COUNTIF(I3,"*"&$A$2:$A$300&"*"),0))}.
Question: What is the best way to "scrub" out the brand names from free text?
my current process is start with the list:
[TABLE="width: 311"]
<colgroup><col></colgroup><tbody>[TR]
[TD]Original Product Name[/TD]
[/TR]
[TR]
[TD]Accounts Receivable, JD Edwards EnterpriseOne[/TD]
[/TR]
[TR]
[TD]IBM Kenexa Employee Assessments[/TD]
[/TR]
[TR]
[TD]Catalog Management, PeopleSoft[/TD]
[/TR]
[TR]
[TD]SAP Ariba Invoice Management
Text to columns, sort, then manually read through to determine which are "brand" names:
Column A Column B
[TABLE="width: 194"]
<colgroup><col><col></colgroup><tbody>[TR]
[TD]Accounts[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]Ariba[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Assessments[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]Catalog[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]Edwards[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Employee[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]EnterpriseOne[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]IBM[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Invoice[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]JD[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Kenexa[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Management[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]Management,[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]PeopleSoft[/TD]
[TD]USE[/TD]
[/TR]
[TR]
[TD]Receivable,[/TD]
[TD]SKIP-Common[/TD]
[/TR]
[TR]
[TD]SAP[/TD]
[TD]USE[/TD]
[/TR]
</tbody>[/TABLE]
I've been using countifs to determine how frequent the word occurs and LEN to quickly go through short words, but this is not efficient.
Is there a better way to do this?[/TD]
[/TR]
[TR]
[TD][/TD]
[/TR]
</tbody>[/TABLE]