Extract all occurrences of specific characters from strings

There is something similar in my code.

mystring.Split(new[]{"/","*"}, StringSplitOptions.RemoveEmptyEntries);

      

however, I actually want to separate mystring

from two arrays, one containing the highlighted items above, and the other array holding the separators above in the order they appear on the line.

I could use .IndexOf

to keep searching until I fetch all of them, but somehow I think that would be redundant. Is there a way to do this in .NET? If possible, I want to avoid LINQ.

Thank.

+3


source to share


5 answers


Something like:



var separators = new char[] { '/', '*' };
var words = new List<string>();
var delimiters = new List<string>();
var idx = source.IndexOfAny(separators);
var prevIdx = 0;
while (idx > -1)
{
    if (idx - prevIdx > 0)
        words.Add(source.Substring(prevIdx, idx - prevIdx));

    prevIdx = idx + 1;
    delimiters.Add(source.Substring(idx, 1));
    idx = source.IndexOfAny(separators, idx + 1);
}

      

+4


source


If I understand the questionnaire correctly, it wants the actual separated items as well as the delimiters.

I think the following code will work:



        List<string> SeparatedItems = new List<string>();
        List<string> Delimiters = new List<string>();

        string sTestString = "mytest/string*isthis**and not/this";

        string sSeparatedItemString = String.Empty;
        foreach(char c in sTestString) {

            if(c == '/' || c == '*') {

                Delimiters.Add(c.ToString());
                if(sSeparatedItemString != String.Empty) {

                    SeparatedItems.Add(sSeparatedItemString);
                    sSeparatedItemString = String.Empty;  
                }
            }
            else {

                sSeparatedItemString += c.ToString();    
            }   

        }

        if(sSeparatedItemString != String.Empty) {

            SeparatedItems.Add(sSeparatedItemString);
        }

      

+2


source


Try the following:

var items = new List<string>();
var delimiters = new List<string>();
items.AddRange(Regex.Split(text, @"(?<=/)|(?=/)|(?<=\*)|(?=\*)"));

for (int i = 0; i < items.Count; )
{
    string item = items[i];
    if (item == "*" || item == "/")
    {
        delimiters.Add(item);
        items.RemoveAt(i);
    }
    else if (item == "")
    {
        items.RemoveAt(i);
    }
    else
    {
        i++;
    }
}

      

+1


source


You can treat Regex expression using named groups. Try a nested named group. Outside includes splitter grab and inner content grab only.

0


source


Since you are working in .NET 2.0, I would say using IndexOf

is one of the easiest ways to solve the problem:

public static int CountOccurences(string input, string pattern)
{
    int count = 0;
    int i = 0;

    while (i = input.IndexOf(pattern, i) != -1)
        count++;
    return count;
}

      

Rob Smyth's solution will work as well, but I find this to be the simplest and most straightforward.

0


source







All Articles