TagsProvider is a tool for extracting HTML tags from a string, in event-driven way. Helps to extract text, structured data, from a specific site.
Go to file
Alexander Kozachenko 0ed3dd52f1 Tags provider rework.
Extend PipelineMessagePublisher directly
2023-12-08 04:43:09 +03:00
pack Tags provider rework. 2023-12-08 04:43:09 +03:00
src Tags provider rework. 2023-12-08 04:43:09 +03:00
tests Removed obsolete TagsProviderListener 2023-12-08 00:53:59 +03:00
Changelog.md moved to src 2023-11-06 19:32:19 +03:00
License moved to src 2023-11-06 19:32:19 +03:00
Readme.md Updated Readme for 2.0.0-rc1.3 2023-12-02 20:02:38 +03:00
src.code-workspace Release v1.0.2 2023-11-07 16:51:25 +03:00
src.sln Release v1.0.2 2023-11-07 16:51:25 +03:00

ProSol.Html.TagsProvider

TagsProvider is a tool for extracting HTML tags from a string, in event-driven way. Helps to extract text, structured data, from a specific site.

How to use?

Install the package:

dotnet add package ProSol.Html.TagsProvider --version 2.0.0-rc1.3

Fetch some html:
```csharp
var url = "https://en.wikipedia.org/wiki/Food_energy";
var html = HtmlSource.GetHtmlAsync(url).Result;

Process all a tag:

var provider = new TagsProvider();
provider.Subscribe(new ConsoleLogObserver(html), "a");
provider.Process(html);

That's it! The provider notifies about any tag met and its data:

  • name,
  • range of entire tag,
  • range of inner content.

More demos here.